SCALAR: Learning and Composing Skills through LLM Guided Symbolic Planning and Deep RL Grounding
arXiv:2603.09036v1 Announce Type: new Abstract: LM-based agents excel when given high-level action APIs but struggle to ground language into low-level control. Prior work has LLMs generate skills or reward functions for RL, but these one-shot approaches lack feedback to correct...
Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation
arXiv:2603.09053v1 Announce Type: new Abstract: Simulation-to-decision learning enables safe policy training in digital environments without risking real-world deployment, and has become essential in mission-critical domains such as supply chains and industrial systems. However, simulators learned from noisy or biased real-world...
Dynamic Multi-period Experts for Online Time Series Forecasting
arXiv:2603.09062v1 Announce Type: new Abstract: Online Time Series Forecasting (OTSF) requires models to continuously adapt to concept drift. However, existing methods often treat concept drift as a monolithic phenomenon. To address this limitation, we first redefine concept drift by categorizing...
Learning Adaptive LLM Decoding
arXiv:2603.09065v1 Announce Type: new Abstract: Decoding from large language models (LLMs) typically relies on fixed sampling hyperparameters (e.g., temperature, top-p), despite substantial variation in task difficulty and uncertainty across prompts and individual decoding steps. We propose to learn adaptive decoding...
PPO-Based Hybrid Optimization for RIS-Assisted Semantic Vehicular Edge Computing
arXiv:2603.09082v1 Announce Type: new Abstract: To support latency-sensitive Internet of Vehicles (IoV) applications amidst dynamic environments and intermittent links, this paper proposes a Reconfigurable Intelligent Surface (RIS)-aided semantic-aware Vehicle Edge Computing (VEC) framework. This approach integrates RIS to optimize wireless...
Overcoming Valid Action Suppression in Unmasked Policy Gradient Algorithms
arXiv:2603.09090v1 Announce Type: new Abstract: In reinforcement learning environments with state-dependent action validity, action masking consistently outperforms penalty-based handling of invalid actions, yet existing theory only shows that masking preserves the policy gradient theorem. We identify a distinct failure mode...
Probabilistic Hysteresis Factor Prediction for Electric Vehicle Batteries with Graphite Anodes Containing Silicon
arXiv:2603.09103v1 Announce Type: new Abstract: Batteries with silicon-graphite-based anodes, which offer higher energy density and improved charging performance, introduce pronounced voltage hysteresis, making state-of-charge (SoC) estimation particularly challenging. Existing approaches to modeling hysteresis rely on exhaustive high-fidelity tests or focus...
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards
arXiv:2603.09117v1 Announce Type: new Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) significantly enhances large language models (LLMs) reasoning but severely suffers from calibration degeneration, where models become excessively over-confident in incorrect answers. Previous studies devote to directly incorporating calibration objective...
Causally Sufficient and Necessary Feature Expansion for Class-Incremental Learning
arXiv:2603.09145v1 Announce Type: new Abstract: Current expansion-based methods for Class Incremental Learning (CIL) effectively mitigate catastrophic forgetting by freezing old features. However, such task-specific features learned from the new task may collide with the old features. From a causal perspective,...
Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL
arXiv:2603.09161v1 Announce Type: new Abstract: Learning effective netlist representations is fundamentally constrained by the scarcity of labeled datasets, as real designs are protected by Intellectual Property (IP) and costly to annotate. Existing work therefore focuses on small-scale circuits with clean...
GIAT: A Geologically-Informed Attention Transformer for Lithology Identification
arXiv:2603.09165v1 Announce Type: new Abstract: Accurate lithology identification from well logs is crucial for subsurface resource evaluation. Although Transformer-based models excel at sequence modeling, their "black-box" nature and lack of geological guidance limit their performance and trustworthiness. To overcome these...
The Radio-Frequency Transformer for Signal Separation
arXiv:2603.09201v1 Announce Type: new Abstract: We study a problem of signal separation: estimating a signal of interest (SOI) contaminated by an unknown non-Gaussian background/interference. Given the training data consisting of examples of SOI and interference, we show how to build...
Strategically Robust Multi-Agent Reinforcement Learning with Linear Function Approximation
arXiv:2603.09208v1 Announce Type: new Abstract: Provably efficient and robust equilibrium computation in general-sum Markov games remains a core challenge in multi-agent reinforcement learning. Nash equilibrium is computationally intractable in general and brittle due to equilibrium multiplicity and sensitivity to approximation...
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control
arXiv:2603.09221v1 Announce Type: new Abstract: Associative memory has long underpinned the design of sequential models. Beyond recall, humans reason by projecting future states and selecting goal-directed actions, a capability that modern language models increasingly require but do not natively encode....
Transductive Generalization via Optimal Transport and Its Application to Graph Node Classification
arXiv:2603.09257v1 Announce Type: new Abstract: Many existing transductive bounds rely on classical complexity measures that are computationally intractable and often misaligned with empirical behavior. In this work, we establish new representation-based generalization bounds in a distribution-free transductive setting, where learned...
DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data
arXiv:2603.09274v1 Announce Type: new Abstract: Spatiotemporal information is at the core of diverse sensory processing and computational tasks. Feed-forward spiking neural networks can be used to solve these tasks while offering potential benefits in terms of energy efficiency by computing...
A Gaussian Comparison Theorem for Training Dynamics in Machine Learning
arXiv:2603.09310v1 Announce Type: new Abstract: We study training algorithms with data following a Gaussian mixture model. For a specific family of such algorithms, we present a non-asymptotic result, connecting the evolution of the model to a surrogate dynamical system, which...
Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning
arXiv:2603.09331v1 Announce Type: new Abstract: We introduce Reward-Zero, a general-purpose implicit reward mechanism that transforms natural-language task descriptions into dense, semantically grounded progress signals for reinforcement learning (RL). Reward-Zero serves as a simple yet sophisticated universal reward function that leverages...
TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection
arXiv:2603.09349v1 Announce Type: new Abstract: A significant number of anomalous nodes in the real world, such as fake news, noncompliant users, malicious transactions, and malicious posts, severely compromises the health of the graph data ecosystem and urgently requires effective identification...
Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework
arXiv:2603.09353v1 Announce Type: new Abstract: Surface roughness in Material Extrusion Additive Manufacturing varies across a part and is difficult to anticipate during process planning because it depends on both printing parameters and local surface inclination, which governs the staircase effect....
Democratising Clinical AI through Dataset Condensation for Classical Clinical Models
arXiv:2603.09356v1 Announce Type: new Abstract: Dataset condensation (DC) learns a compact synthetic dataset that enables models to match the performance of full-data training, prioritising utility over distributional fidelity. While typically explored for computational efficiency, DC also holds promise for healthcare...
From Representation to Clusters: A Contrastive Learning Approach for Attributed Hypergraph Clustering
arXiv:2603.09370v1 Announce Type: new Abstract: Contrastive learning has demonstrated strong performance in attributed hypergraph clustering. Typically, existing methods based on contrastive learning first learn node embeddings and then apply clustering algorithms, such as k-means, to these embeddings to obtain the...
The how and why of gun control
A Second Opinion is a recurring series by Haley Proctor on the Second Amendment and constitutional litigation. Last Monday, the Supreme Court heard argument in United States v. Hemani. In […]The postThe how and why of gun controlappeared first onSCOTUSblog.
SCOTUSblog’s new podcast partners
SCOTUSblog is excited to announce the addition of podcasts Amarica’s Constitution and Divided Argument to its podcast lineup, joining Advisory Opinions. While both podcasts will maintain their editorial and creative independence, […]The postSCOTUSblog’s new podcast partnersappeared first onSCOTUSblog.
Birthright citizenship: legal takeaways of mice and men and elephants and dogs
Brothers in Law is a recurring series by brothers Akhil and Vikram Amar, with special emphasis on measuring what the Supreme Court says against what the Constitution itself says. For more content from […]The postBirthright citizenship: legal takeaways of mice...
SCOTUStoday for Tuesday, March 10
SCOTUSblog is excited to announce the addition of podcasts Amarica’s Constitution and Divided Argument to its podcast lineup, joining Advisory Opinions. In a new, jam-packed episode, the hosts of all […]The postSCOTUStoday for Tuesday, March 10appeared first onSCOTUSblog.
AI Now Co-ED Amba Kak Gives Remarks Before the UN General Assembly on AI Governance - AI Now Institute
AI-powered apps struggle with long-term retention, new report shows
AI can drive stronger early monetization for apps, but sustaining value remains the challenge, RevenueCat's latest report finds.
ChatGPT can now create interactive visuals to help you understand math and science concepts
Instead of just reading an explanation or looking at a static diagram, users can now engage directly with interactive visuals.
AgentMail raises $6M to build an email service for AI agents
AgentMail provides an API platform that lets you give AI agents their own email inboxes, with support for two-way conversations, parsing, threading, labeling, searching, and replying.