Preventing Curriculum Collapse in Self-Evolving Reasoning Systems
arXiv:2603.13309v1 Announce Type: new Abstract: Self-evolving reasoning frameworks let LLMs improve their reasoning capabilities by iteratively generating and solving problems without external supervision, using verifiable rewards. Ideally, such systems are expected to explore a diverse problem space and propose new...
Neural Approximation and Its Applications
arXiv:2603.13311v1 Announce Type: new Abstract: Multivariate function approximation is a fundamental problem in machine learning. Classic multivariate function approximations rely on hand-crafted basis functions (e.g., polynomial basis and Fourier basis), which limits their approximation ability and data adaptation ability, resulting...
Linear Predictability of Attention Heads in Large Language Models
arXiv:2603.13314v1 Announce Type: new Abstract: Large language model (LLM) inference is increasingly bottlenecked by the Key-Value (KV) cache, yet the fine-grained structure of attention-head activations remains poorly understood. We show that pretrained Transformers exhibit a pervasive inter-head linear structure: for...
Evaluating Large Language Models for Gait Classification Using Text-Encoded Kinematic Waveforms
arXiv:2603.13317v1 Announce Type: new Abstract: Background: Machine learning (ML) enhances gait analysis but often lacks the level of interpretability desired for clinical adoption. Large Language Models (LLMs) may offer explanatory capabilities and confidence-aware outputs when applied to structured kinematic data....
LightningRL: Breaking the Accuracy-Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
arXiv:2603.13319v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have emerged as a promising paradigm for parallel token generation, with block-wise variants garnering significant research interest. Despite their potential, existing dLLMs typically suffer from a rigid accuracy-parallelism trade-off: increasing...
Modular Neural Computer
arXiv:2603.13323v1 Announce Type: new Abstract: This paper introduces the Modular Neural Computer (MNC), a memory-augmented neural architecture for exact algorithmic computation on variable-length inputs. The model combines an external associative memory of scalar cells, explicit read and write heads, a...
The Challenge of Out-Of-Distribution Detection in Motor Imagery BCIs
arXiv:2603.13324v1 Announce Type: new Abstract: Machine Learning classifiers used in Brain-Computer Interfaces make classifications based on the distribution of data they were trained on. When they need to make inferences on samples that fall outside of this distribution, they can...
RBF-Solver: A Multistep Sampler for Diffusion Probabilistic Models via Radial Basis Functions
arXiv:2603.13330v1 Announce Type: new Abstract: Diffusion probabilistic models (DPMs) are widely adopted for their outstanding generative fidelity, yet their sampling is computationally demanding. Polynomial-based multistep samplers mitigate this cost by accelerating inference; however, despite their theoretical accuracy guarantees, they generate...
MS2MetGAN: Latent-space adversarial training for metabolite-spectrum matching in MS/MS database search
arXiv:2603.13342v1 Announce Type: new Abstract: Database search is a widely used approach for identifying metabolites from tandem mass spectra (MS/MS). In this strategy, an experimental spectrum is matched against a user-specified database of candidate metabolites, and candidates are ranked such...
PolyGLU: State-Conditional Activation Routing in Transformer Feed-Forward Networks
arXiv:2603.13347v1 Announce Type: new Abstract: Biological neural systems employ diverse neurotransmitters -- glutamate, GABA, dopamine, acetylcholine -- to implement distinct signal-processing modalities within shared neural circuits. In contrast, modern transformers apply a single fixed activation function across all feed-forward neurons....
Thermal Robustness of Retrieval in Dense Associative Memories: LSE vs LSR Kernels
arXiv:2603.13350v1 Announce Type: new Abstract: Understanding whether retrieval in dense associative memories survives thermal noise is essential for bridging zero-temperature capacity proofs with the finite-temperature conditions of practical inference and biological computation. We use Monte Carlo simulations to map the...
A Hierarchical End-of-Turn Model with Primary Speaker Segmentation for Real-Time Conversational AI
arXiv:2603.13379v1 Announce Type: new Abstract: We present a real-time front-end for voice-based conversational AI to enable natural turn-taking in two-speaker scenarios by combining primary speaker segmentation with hierarchical End-of-Turn (EOT) detection. To operate robustly in multi-speaker environments, the system continuously...
Justices will hear argument on Trump administration’s removal of protected status for Syrian and Haitian nationals
The Supreme Court announced on Monday afternoon that it will hear oral argument on whether the Trump administration can end a program that allows several thousand Syrians and approximately 350,000 […]The postJustices will hear argument on Trump administration’s removal of...
Haitian nationals ask court to deny Trump administration’s request to remove their protected status
A group of Haitian nationals urged the Supreme Court on Monday to leave in place a ruling by a federal judge in Washington, D.C., that allows them to stay in […]The postHaitian nationals ask court to deny Trump administration’s request...
Birthright citizenship: a response to Pete Patterson
Brothers in Law is a recurring series by brothers Akhil and Vikram Amar, with special emphasis on measuring what the Supreme Court says against what the Constitution itself says. For more content from […]The postBirthright citizenship: a response to Pete...
A 95th birthday tribute to legendary SCOTUSblog reporter Lyle Denniston
The inimitable Lyle Denniston, who served as the primary reporter for SCOTUSblog from 2004 until 2016, celebrates his 95th birthday today. Lyle began his reporting career in 1948 at the […]The postA 95th birthday tribute to legendary SCOTUSblog reporter Lyle...
SCOTUStoday: Trump v. the Fed
Six years ago today, the court announced that it was postponing its March argument session in response to the COVID-19 pandemic. The press release noted that its “postponement of argument […]The postSCOTUStoday: Trump v. the Fedappeared first onSCOTUSblog.
OpenAI’s own mental health experts unanimously opposed “naughty” ChatGPT launch
OpenAI draws a line between AI “smut” and porn. Experts fear it’s all unhealthy.
Memories AI is building the visual memory layer for wearables and robotics
Memories.ai is building a large visual memory model that can index and retrieve video-recorded memories for physical AI.
From Garbage to Gold: A Data-Architectural Theory of Predictive Robustness
arXiv:2603.12288v1 Announce Type: cross Abstract: Tabular machine learning presents a paradox: modern models achieve state-of-the-art performance using high-dimensional (high-D), collinear, error-prone data, defying the "Garbage In, Garbage Out" mantra. To help resolve this, we synthesize principles from Information Theory, Latent...
Aligning Language Models from User Interactions
arXiv:2603.12273v1 Announce Type: cross Abstract: Multi-turn user interactions are among the most abundant data produced by language models, yet we lack effective methods to learn from them. While typically discarded, these interactions often contain useful information: follow-up user messages may...
VQQA: An Agentic Approach for Video Evaluation and Quality Improvement
arXiv:2603.12310v1 Announce Type: cross Abstract: Despite rapid advancements in video generation models, aligning their outputs with complex user intent remains challenging. Existing test-time optimization methods are typically either computationally expensive or require white-box access to model internals. To address this,...
Thermodynamics of Reinforcement Learning Curricula
arXiv:2603.12324v1 Announce Type: cross Abstract: Connections between statistical mechanics and machine learning have repeatedly proven fruitful, providing insight into optimization, generalization, and representation learning. In this work, we follow this tradition by leveraging results from non-equilibrium thermodynamics to formalize curriculum...
On Using Machine Learning to Early Detect Catastrophic Failures in Marine Diesel Engines
arXiv:2603.12733v1 Announce Type: new Abstract: Catastrophic failures of marine engines imply severe loss of functionality and destroy or damage the systems irreversibly. Being sudden and often unpredictable events, they pose a severe threat to navigation, crew, and passengers. The abrupt...
Beyond Final Answers: CRYSTAL Benchmark for Transparent Multimodal Reasoning Evaluation
arXiv:2603.13099v1 Announce Type: new Abstract: We introduce **CRYSTAL** (*__C__lear __R__easoning via __Y__ielded __S__teps, __T__raceability and __L__ogic*), a diagnostic benchmark with 6,372 instances that evaluates multimodal reasoning through verifiable intermediate steps. We propose two complementary metrics: *Match F1*, which scores step-level...
HCP-DCNet: A Hierarchical Causal Primitive Dynamic Composition Network for Self-Improving Causal Understanding
arXiv:2603.12305v1 Announce Type: cross Abstract: The ability to understand and reason about cause and effect -- encompassing interventions, counterfactuals, and underlying mechanisms -- is a cornerstone of robust artificial intelligence. While deep learning excels at pattern recognition, it fundamentally lacks...
Structured Distillation for Personalized Agent Memory: 11x Token Reduction with Retrieval Preservation
arXiv:2603.13017v1 Announce Type: new Abstract: Long conversations with an AI agent create a simple problem for one user: the history is useful, but carrying it verbatim is expensive. We study personalized agent memory: one user's conversation history with an agent,...
Context-Enriched Natural Language Descriptions of Vessel Trajectories
arXiv:2603.12287v1 Announce Type: new Abstract: We address the problem of transforming raw vessel trajectory data collected from AIS into structured and semantically enriched representations interpretable by humans and directly usable by machine reasoning systems. We propose a context-aware trajectory abstraction...
Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning
arXiv:2603.12290v1 Announce Type: cross Abstract: Scholarly web is a vast network of knowledge connected by citations. However, this system is increasingly compromised by miscitation, where references do not support or even contradict the claims they are cited for. Current miscitation...
ODRL Policy Comparison Through Normalisation
arXiv:2603.12926v1 Announce Type: new Abstract: The ODRL language has become the standard for representing policies and regulations for digital rights. However its complexity is a barrier to its usage, which has caused many related theoretical and practical works to focus...