Hypothesis Class Determines Explanation: Why Accurate Models Disagree on Feature Attribution
arXiv:2603.15821v1 Announce Type: new Abstract: The assumption that prediction-equivalent models produce equivalent explanations underlies many practices in explainable AI, including model selection, auditing, and regulatory evaluation. In this work, we show that this assumption does not hold. Through a large-scale...
When Stability Fails: Hidden Failure Modes Of LLMS in Data-Constrained Scientific Decision-Making
arXiv:2603.15840v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as decision-support tools in data-constrained scientific workflows, where correctness and validity are critical. However, evaluation practices often emphasize stability or reproducibility across repeated runs. While these properties are...
Informationally Compressive Anonymization: Non-Degrading Sensitive Input Protection for Privacy-Preserving Supervised Machine Learning
arXiv:2603.15842v1 Announce Type: new Abstract: Modern machine learning systems increasingly rely on sensitive data, creating significant privacy, security, and regulatory risks that existing privacy-preserving machine learning (ppML) techniques, such as Differential Privacy (DP) and Homomorphic Encryption (HE), address only at...
Evaluating Black-Box Vulnerabilities with Wasserstein-Constrained Data Perturbations
arXiv:2603.15867v1 Announce Type: new Abstract: The massive use of Machine Learning (ML) tools in industry comes with critical challenges, such as the lack of explainable models and the use of black-box algorithms. We address this issue by applying Optimal Transport...
Counteractive RL: Rethinking Core Principles for Efficient and Scalable Deep Reinforcement Learning
arXiv:2603.15871v1 Announce Type: new Abstract: Following the pivotal success of learning strategies to win at tasks, solely by interacting with an environment without any supervision, agents have gained the ability to make sequential decisions in complex MDPs. Yet, reinforcement learning...
Electrodermal Activity as a Unimodal Signal for Aerobic Exercise Detection in Wearable Sensors
arXiv:2603.15880v1 Announce Type: new Abstract: Electrodermal Activity (EDA) is a non-invasive physiological signal widely available in wearable devices and reflects sympathetic nervous system (SNS) activation. Prior multi-modal studies have demonstrated robust performance in distinguishing stress and exercise states when EDA...
Federated Learning for Privacy-Preserving Medical AI
arXiv:2603.15901v1 Announce Type: new Abstract: This dissertation investigates privacy-preserving federated learning for Alzheimer's disease classification using three-dimensional MRI data from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Existing methodologies often suffer from unrealistic data partitioning, inadequate privacy guarantees, and insufficient benchmarking,...
Game-Theory-Assisted Reinforcement Learning for Border Defense: Early Termination based on Analytical Solutions
arXiv:2603.15907v1 Announce Type: new Abstract: Game theory provides the gold standard for analyzing adversarial engagements, offering strong optimality guarantees. However, these guarantees often become brittle when assumptions such as perfect information are violated. Reinforcement learning (RL), by contrast, is adaptive...
The Agentic Researcher: A Practical Guide to AI-Assisted Research in Mathematics and Machine Learning
arXiv:2603.15914v1 Announce Type: new Abstract: AI tools and agents are reshaping how researchers work, from proving theorems to training neural networks. Yet for many, it remains unclear how these tools fit into everyday research practice. This paper is a practical...
Auto Researching, not hyperparameter tuning: Convergence Analysis of 10,000 Experiments
arXiv:2603.15916v1 Announce Type: new Abstract: When LLM agents autonomously design ML experiments, do they perform genuine architecture search -- or do they default to hyperparameter tuning within a narrow region of the design space? We answer this question by analyzing...
Generative Inverse Design with Abstention via Diagonal Flow Matching
arXiv:2603.15925v1 Announce Type: new Abstract: Inverse design aims to find design parameters $x$ achieving target performance $y^*$. Generative approaches learn bidirectional mappings between designs and labels, enabling diverse solution sampling. However, standard conditional flow matching (CFM), when adapted to inverse...
Evaluating Causal Discovery Algorithms for Path-Specific Fairness and Utility in Healthcare
arXiv:2603.15926v1 Announce Type: new Abstract: Causal discovery in health data faces evaluation challenges when ground truth is unknown. We address this by collaborating with experts to construct proxy ground-truth graphs, establishing benchmarks for synthetic Alzheimer's disease and heart failure clinical...
Discovery of interaction and diffusion kernels in particle-to-mean-field multi-agent systems
arXiv:2603.15927v1 Announce Type: new Abstract: We propose a data-driven framework to learn interaction kernels in stochastic multi-agent systems. Our approach aims at identifying the functional form of nonlocal interaction and diffusion terms directly from trajectory data, without any a priori...
Data-Local Autonomous LLM-Guided Neural Architecture Search for Multiclass Multimodal Time-Series Classification
arXiv:2603.15939v1 Announce Type: new Abstract: Applying machine learning to sensitive time-series data is often bottlenecked by the iteration loop: Performance depends strongly on preprocessing and architecture, yet training often has to run on-premise under strict data-local constraints. This is a...
GASP: Guided Asymmetric Self-Play For Coding LLMs
arXiv:2603.15957v1 Announce Type: new Abstract: Asymmetric self-play has emerged as a promising paradigm for post-training large language models, where a teacher continually generates questions for a student to solve at the edge of the student's learnability. Although these methods promise...
Deriving Hyperparameter Scaling Laws via Modern Optimization Theory
arXiv:2603.15958v1 Announce Type: new Abstract: Hyperparameter transfer has become an important component of modern large-scale training recipes. Existing methods, such as muP, primarily focus on transfer between model sizes, with transfer across batch sizes and training horizons often relying on...
Determinism in the Undetermined: Deterministic Output in Charge-Conserving Continuous-Time Neuromorphic Systems with Temporal Stochasticity
arXiv:2603.15987v1 Announce Type: new Abstract: Achieving deterministic computation results in asynchronous neuromorphic systems remains a fundamental challenge due to the inherent temporal stochasticity of continuous-time hardware. To address this, we develop a unified continuous-time framework for spiking neural networks (SNNs)...
The Importance of Being Smoothly Calibrated
arXiv:2603.16015v1 Announce Type: new Abstract: Recent work has highlighted the centrality of smooth calibration [Kakade and Foster, 2008] as a robust measure of calibration error. We generalize, unify, and extend previous results on smooth calibration, both as a robust calibration...
Residual Stream Duality in Modern Transformer Architectures
arXiv:2603.16039v1 Announce Type: new Abstract: Recent work has made clear that the residual pathway is not mere optimization plumbing; it is part of the model's representational machinery. We agree, but argue that the cleanest way to organize this design space...
Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition
arXiv:2603.16043v1 Announce Type: new Abstract: Human Activity Recognition using wearable inertial sensors is foundational to healthcare monitoring, fitness analytics, and context-aware computing, yet its deployment is hindered by cross-user variability arising from heterogeneous physiological traits, motor habits, and sensor placements....
Adaptive regularization parameter selection for high-dimensional inverse problems: A Bayesian approach with Tucker low-rank constraints
arXiv:2603.16066v1 Announce Type: new Abstract: This paper introduces a novel variational Bayesian method that integrates Tucker decomposition for efficient high-dimensional inverse problem solving. The method reduces computational complexity by transforming variational inference from a high-dimensional space to a lower-dimensional core...
MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal Scaling of Diffusion Language Models
arXiv:2603.16077v1 Announce Type: new Abstract: Masked diffusion models (MDM) exhibit superior generalization when learned using a Partial masking scheme (Prime). This approach converts tokens into sub-tokens and models the diffusion process at the sub-token level. We identify two limitations of...
A Depth-Aware Comparative Study of Euclidean and Hyperbolic Graph Neural Networks on Bitcoin Transaction Systems
arXiv:2603.16080v1 Announce Type: new Abstract: Bitcoin transaction networks are large scale socio- technical systems in which activities are represented through multi-hop interaction patterns. Graph Neural Networks(GNNs) have become a widely adopted tool for analyzing such systems, supporting tasks such as...
Functorial Neural Architectures from Higher Inductive Types
arXiv:2603.16123v1 Announce Type: new Abstract: Neural networks systematically fail at compositional generalization -- producing correct outputs for novel combinations of known parts. We show that this failure is architectural: compositional generalization is equivalent to functoriality of the decoder, and this...
Noisy Data is Destructive to Reinforcement Learning with Verifiable Rewards
arXiv:2603.16140v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has driven recent capability advances of large language models across various domains. Recent studies suggest that improved RLVR algorithms allow models to learn effectively from incorrect annotations, achieving performance...
The Supreme Court of Canada
Welcome to SCOTUSblog’s recurring series in which we interview experts on different supreme courts around the world and how they compare to our own. For our debut column, we covered […]The postThe Supreme Court of Canadaappeared first onSCOTUSblog.
The biggest names on the briefs
Empirical SCOTUS is a recurring series by Adam Feldman that looks at Supreme Court data, primarily in the form of opinions and oral arguments, to provide insights into the justices’ decision making and […]The postThe biggest names on the briefsappeared...
SCOTUStoday for Tuesday, March 17
Happy St. Patrick’s Day! We recommend celebrating by reading about Supreme Court justices of Irish descent.The postSCOTUStoday for Tuesday, March 17appeared first onSCOTUSblog.
Apple can delist apps "with or without cause," judge says in loss for Musi app
Judge tosses Musi case against Apple, sanctions lawyers for "mak[ing] up facts."
Trump's plan to shut down weather and climate center triggers lawsuit
Suit: The National Center for Atmospheric Research is to be terminated for no rational reason.