Issues with Measuring Task Complexity via Random Policies in Robotic Tasks
arXiv:2602.18856v1 Announce Type: new Abstract: Reinforcement learning (RL) has enabled major advances in fields such as robotics and natural language processing. A key challenge in RL is measuring task complexity, which is essential for creating meaningful benchmarks and designing effective...
VariBASed: Variational Bayes-Adaptive Sequential Monte-Carlo Planning for Deep Reinforcement Learning
arXiv:2602.18857v1 Announce Type: new Abstract: Optimally trading-off exploration and exploitation is the holy grail of reinforcement learning as it promises maximal data-efficiency for solving any task. Bayes-optimal agents achieve this, but obtaining the belief-state and performing planning are both typically...
Hyperbolic Busemann Neural Networks
arXiv:2602.18858v1 Announce Type: new Abstract: Hyperbolic spaces provide a natural geometry for representing hierarchical and tree-structured data due to their exponential volume growth. To leverage these benefits, neural networks require intrinsic and efficient components that operate directly in hyperbolic space....
Boosting for Vector-Valued Prediction and Conditional Density Estimation
arXiv:2602.18866v1 Announce Type: new Abstract: Despite the widespread use of boosting in structured prediction, a general theoretical understanding of aggregation beyond scalar losses remains incomplete. We study vector-valued and conditional density prediction under general divergences and identify stability conditions under...
HEHRGNN: A Unified Embedding Model for Knowledge Graphs with Hyperedges and Hyper-Relational Edges
arXiv:2602.18897v1 Announce Type: new Abstract: Knowledge Graph(KG) has gained traction as a machine-readable organization of real-world knowledge for analytics using artificial intelligence systems. Graph Neural Network(GNN), is proven to be an effective KG embedding technique that enables various downstream tasks...
PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse
arXiv:2602.18904v1 Announce Type: new Abstract: Vector-quantized autoencoders deliver high-fidelity latents but suffer inherent flaws: the quantizer is non-differentiable, requires straight-through hacks, and is prone to collapse. We address these issues at the root by replacing VQ with a simple, principled,...
Court holds that U.S. Postal Service can’t be sued over intentionally misdelivered mail
A divided Supreme Court sided with the federal government on Tuesday in U.S. Postal Service v. Konan, a dispute over mishandled mail. Writing for a 5-4 majority, Justice Clarence Thomas […]The postCourt holds that U.S. Postal Service can’t be sued...
The sudden return of summary reversals
Nuts and Bolts is a recurring series by Stephen Wermiel providing insights into the mechanics of how the Supreme Court works. A Supreme Court shortcut for deciding cases without full […]The postThe sudden return of summary reversalsappeared first onSCOTUSblog.
Oral argument live blog for Monday, March 2
On Monday, March 2, we will be live blogging as the court hears argument in United States v. Hemani, on whether a federal statute that prohibits gun possession by users […]The postOral argument live blog for Monday, March 2appeared first...
Standing in and after Bost
Controlling Opinions is a recurring series by Richard Re that explores the interaction of law, ideology, and discretion at the Supreme Court. The Supreme Court’s recent decision in Bost v. […]The postStanding in and after Bostappeared first onSCOTUSblog.
SCOTUStoday for Tuesday, February 24
On this day in 1803, the Supreme Court released its ruling in Marbury v. Madison, which established the principle of judicial review (or did it?). Mark the anniversary with us […]The postSCOTUStoday for Tuesday, February 24appeared first onSCOTUSblog.
In Defense of Substantive Due Process
Introduction Originalism has a branding and substance problem.[1] If originalism is what it purports to be—impartial and value-free enforcement of the Founders’ intention and “the only approach to text that is compatible with democracy”[2]—more Americans would have faith in the...
Chill
Introduction No concept is more pervasive in the law of freedom of speech than chill.[1] The chilled speech doctrine guards against self-censorship: it permits First Amendment challenges based on the allegation that a law deters the plaintiff or others from...
In a replay of 2019, Apple says a single desktop Mac will be manufactured in the US
Apple is still working to get favorable tariff treatment from the Trump administration.
India’s AI boom pushes firms to trade near-term revenue for users
ChatGPT and rivals are testing whether India's massive AI user boom can translate into paying customers as free offers wind down.
Meta strikes up to $100B AMD chip deal as it chases ‘personal superintelligence’
Meta is buying billions of dollars in AMD AI chips in a multiyear deal tied to a 160 million-share warrant, deepening its push to diversify beyond Nvidia and expand data center capacity.
Oura launches a proprietary AI model focused on women’s health
The model supports questions spanning the full reproductive health spectrum, from early menstrual cycles through menopause.
Final 4 days to save up to $680 on your TechCrunch Disrupt 2026 pass
Just 4 days left before savings of up to $680 on your TechCrunch Disrupt 2026 pass end on February 27 at 11:59 p.m. PT. Register to save at one of the most anticipated tech events of the year.
Nimble raises $47M to give AI agents access to real-time web data
Nimble uses AI agents to search the web, verify and validate the results, and then clean and structure the information into neat tables that can then be queried like a database.
QueryPlot: Generating Geological Evidence Layers using Natural Language Queries for Mineral Exploration
arXiv:2602.17784v1 Announce Type: cross Abstract: Mineral prospectivity mapping requires synthesizing heterogeneous geological knowledge, including textual deposit models and geospatial datasets, to identify regions likely to host specific mineral deposit types. This process is traditionally manual and knowledge-intensive. We present QueryPlot,...
Deep Learning for Dermatology: An Innovative Framework for Approaching Precise Skin Cancer Detection
arXiv:2602.17797v1 Announce Type: cross Abstract: Skin cancer can be life-threatening if not diagnosed early, a prevalent yet preventable disease. Globally, skin cancer is perceived among the finest prevailing cancers and millions of people are diagnosed each year. For the allotment...
Mind the Style: Impact of Communication Style on Human-Chatbot Interaction
arXiv:2602.17850v1 Announce Type: cross Abstract: Conversational agents increasingly mediate everyday digital interactions, yet the effects of their communication style on user experience and task success remain unclear. Addressing this gap, we describe the results of a between-subject user study where...
Enhancing Scientific Literature Chatbots with Retrieval-Augmented Generation: A Performance Evaluation of Vector and Graph-Based Systems
arXiv:2602.17856v1 Announce Type: cross Abstract: This paper investigates the enhancement of scientific literature chatbots through retrieval-augmented generation (RAG), with a focus on evaluating vector- and graph-based retrieval systems. The proposed chatbot leverages both structured (graph) and unstructured (vector) databases to...
Financial time series augmentation using transformer based GAN architecture
arXiv:2602.17865v1 Announce Type: cross Abstract: Time-series forecasting is a critical task across many domains, from engineering to economics, where accurate predictions drive strategic decisions. However, applying advanced deep learning models in challenging, volatile domains like finance is difficult due to...
MantisV2: Closing the Zero-Shot Gap in Time Series Classification with Synthetic Data and Test-Time Strategies
arXiv:2602.17868v1 Announce Type: cross Abstract: Developing foundation models for time series classification is of high practical relevance, as such models can serve as universal feature extractors for diverse downstream tasks. Although early models such as Mantis have shown the promise...
Understanding Unreliability of Steering Vectors in Language Models: Geometric Predictors and the Limits of Linear Approximations
arXiv:2602.17881v1 Announce Type: cross Abstract: Steering vectors are a lightweight method for controlling language model behavior by adding a learned bias to the activations at inference time. Although effective on average, steering effect sizes vary across samples and are unreliable...
Games That Teach, Chats That Convince: Comparing Interactive and Static Formats for Persuasive Learning
arXiv:2602.17905v1 Announce Type: cross Abstract: Interactive systems such as chatbots and games are increasingly used to persuade and educate on sustainability-related topics, yet it remains unclear how different delivery formats shape learning and persuasive outcomes when content is held constant....
Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering
arXiv:2602.17911v1 Announce Type: cross Abstract: Current biomedical question answering (QA) systems often assume that medical knowledge applies uniformly, yet real-world clinical reasoning is inherently conditional: nearly every decision depends on patient-specific factors such as comorbidities and contraindications. Existing benchmarks do...
MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance
arXiv:2602.17930v1 Announce Type: cross Abstract: Reinforcement learning (RL) agents often suffer from high sample complexity in sparse or delayed reward settings due to limited prior structure. Large language models (LLMs) can provide subgoal decompositions, plausible trajectories, and abstract priors that...
Towards More Standardized AI Evaluation: From Models to Agents
arXiv:2602.18029v1 Announce Type: new Abstract: Evaluation is no longer a final checkpoint in the machine learning lifecycle. As AI systems evolve from static models to compound, tool-using agents, evaluation becomes a core control function. The question is no longer "How...