Legal-DC: Benchmarking Retrieval-Augmented Generation for Legal Documents
arXiv:2603.11772v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) has emerged as a promising technology for legal document consultation, yet its application in Chinese legal scenarios faces two key limitations: existing benchmarks lack specialized support for joint retriever-generator evaluation, and mainstream...
Just Use XML: Revisiting Joint Translation and Label Projection
arXiv:2603.12021v1 Announce Type: new Abstract: Label projection is an effective technique for cross-lingual transfer, extending span-annotated datasets from a high-resource language to low-resource ones. Most approaches perform label projection as a separate step after machine translation, and prior work that...
Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers
arXiv:2603.11114v1 Announce Type: new Abstract: Sparse Mixture-of-Experts (MoE) architectures enable efficient scaling of large language models through conditional computation, yet the routing mechanisms responsible for expert selection remain poorly understood. In this work, we introduce routing signatures, a vector representation...
Higher-Order Modular Attention: Fusing Pairwise and Triadic Interactions for Protein Sequences
arXiv:2603.11133v1 Announce Type: new Abstract: Transformer self-attention computes pairwise token interactions, yet protein sequence to phenotype relationships often involve cooperative dependencies among three or more residues that dot product attention does not capture explicitly. We introduce Higher-Order Modular Attention, HOMA,...
Heavy-Tailed Principle Component Analysis
arXiv:2603.11308v1 Announce Type: new Abstract: Principal Component Analysis (PCA) is a cornerstone of dimensionality reduction, yet its classical formulation relies critically on second-order moments and is therefore fragile in the presence of heavy-tailed data and impulsive noise. While numerous robust...
When presidents attack the Supreme Court
During a roundtable at the White House on Friday, March 6, President Donald Trump returned to what has become a familiar refrain in the weeks since the Supreme Court struck […]The postWhen presidents attack the Supreme Courtappeared first onSCOTUSblog.
SCOTUStoday for Thursday, March 12
On this day in 1804, the House of Representatives voted to impeach Justice Samuel Chase, who had been accused of abusing his power by refusing to dismiss biased jurors and […]The postSCOTUStoday for Thursday, March 12appeared first onSCOTUSblog.
HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation
arXiv:2603.10359v1 Announce Type: new Abstract: Distilling reasoning capabilities from Large Reasoning Models (LRMs) into smaller models is typically constrained by the limitation of rejection sampling. Standard methods treat the teacher as a static filter, discarding complex "corner-case" problems where the...
Quantifying Hallucinations in Language Language Models on Medical Textbooks
arXiv:2603.09986v1 Announce Type: cross Abstract: Hallucinations, the tendency for large language models to provide responses with factually incorrect and unsupported claims, is a serious problem within natural language processing for which we do not yet have an effective solution to...
A Governance and Evaluation Framework for Deterministic, Rule-Based Clinical Decision Support in Empiric Antibiotic Prescribing
arXiv:2603.10027v1 Announce Type: cross Abstract: Empiric antibiotic prescribing in high-risk clinical contexts often requires decision making under conditions of incomplete information, where inappropriate coverage or unjustified escalation may compromise safety and antimicrobial stewardship. While clinical decision-support systems have been proposed...
OpenClaw-RL: Train Any Agent Simply by Talking
arXiv:2603.10165v1 Announce Type: new Abstract: Every agent interaction generates a next-state signal, namely the user reply, tool output, terminal or GUI state change that follows each action, yet no existing agentic RL system recovers it as a live, online learning...
Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking
arXiv:2603.10367v1 Announce Type: new Abstract: The performance of task-oriented dialogue models is strongly tied to how well they track dialogue states, which records and updates user information across multi-turn interactions. However, current multi-domain DST encounters two key challenges: the difficulty...
Revisiting Sharpness-Aware Minimization: A More Faithful and Effective Implementation
arXiv:2603.10048v1 Announce Type: new Abstract: Sharpness-Aware Minimization (SAM) enhances generalization by minimizing the maximum training loss within a predefined neighborhood around the parameters. However, its practical implementation approximates this as gradient ascent(s) followed by applying the gradient at the ascent...
Dissecting Chronos: Sparse Autoencoders Reveal Causal Feature Hierarchies in Time Series Foundation Models
arXiv:2603.10071v1 Announce Type: new Abstract: Time series foundation models (TSFMs) are increasingly deployed in high-stakes domains, yet their internal representations remain opaque. We present the first application of sparse autoencoders (SAEs) to a TSFM, training TopK SAEs on activations of...
Large Spikes in Stochastic Gradient Descent: A Large-Deviations View
arXiv:2603.10079v1 Announce Type: new Abstract: We analyse SGD training of a shallow, fully connected network in the NTK scaling and provide a quantitative theory of the catapult phase. We identify an explicit criterion separating two behaviours: When an explicit function...
Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation
arXiv:2603.10093v1 Announce Type: new Abstract: Recent 3D molecular generation methods primarily use asynchronous auto-regressive or synchronous diffusion models. While auto-regressive models build molecules sequentially, they're limited by a short horizon and a discrepancy between training and inference. Conversely, synchronous diffusion...
GSVD for Geometry-Grounded Dataset Comparison: An Alignment Angle Is All You Need
arXiv:2603.10283v1 Announce Type: new Abstract: Geometry-grounded learning asks models to respect structure in the problem domain rather than treating observations as arbitrary vectors. Motivated by this view, we revisit a classical but underused primitive for comparing datasets: linear relations between...
How to make the most of your masked language model for protein engineering
arXiv:2603.10302v1 Announce Type: new Abstract: A plethora of protein language models have been released in recent years. Yet comparatively little work has addressed how to best sample from them to optimize desired biological properties. We fill this gap by proposing...
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance
arXiv:2603.10341v1 Announce Type: new Abstract: Federated active learning (FAL) seeks to reduce annotation cost under privacy constraints, yet its effectiveness degrades in realistic settings with severe global class imbalance and highly heterogeneous clients. We conduct a systematic study of query-model...
Trump administration urges Supreme Court to allow it to revoke protected status for Haitian nationals
The Trump administration on Wednesday asked the Supreme Court to pause a ruling by a federal judge in Washington, D.C., that barred the government from ending a program that allows […]The postTrump administration urges Supreme Court to allow it to...
The First Amendment’s application to public university students: an explainer
Free speech on university campuses is a perennially hot topic, perhaps most recently reflected in protests about the Israeli-Palestinian conflict at places like Ball State University, Harvard, and Columbia. This […]The postThe First Amendment’s application to public university students: an...
SCOTUStoday for Wednesday, March 11
You’ve likely heard of AI bots being used improperly by lawyers, but what about lawsuits over AI bots practicing law without a license? Reuters reported on one such case last […]The postSCOTUStoday for Wednesday, March 11appeared first onSCOTUSblog.
FCC chair blasts Amazon after it criticizes SpaceX megaconstellation
Will it really take "centuries" for SpaceX to deploy its megaconstellation?
What crackdown? Trump's EPA enforcement claims don't pass sniff test.
75% of the criminal cases closed last fiscal year originated before Trump took office.
Ford’s new AI assistant will help fleet owners know if seatbelts are being used
Ford Pro AI debuted at Work Truck Week in Indianapolis and is now available to all of its U.S.-based Pro telematics subscribers.
Liberty of Conscience, Political Process Theory, and Founding-Era Free Exercise
Religious freedom claimants have achieved tremendous success before the Supreme Court in recent years. Yet free exercise jurisprudence has bounced between skepticism and embrace...The postLiberty of Conscience, Political Process Theory, and Founding-Era Free Exerciseappeared first onHarvard Law Review.
Sun Valley Orchards, LLCv. United States Department of Labor
In SEC v. Jarkesy, the Supreme Court failed to fully clarify the “unquestionably muddy” relationship between Article III and the Seventh Amendment. Yet it...The post<em>Sun Valley Orchards, LLC<br>v. United States Department of Labor</em>appeared first onHarvard Law Review.
United States v. Johnson
Drug detection dogs are critical tools in the fight against drug trafficking. However, law enforcement canines are imperfect: They sometimes incorrectly alert when performing...The post<em>United States v. Johnson</em>appeared first onHarvard Law Review.
Time, Identity and Consciousness in Language Model Agents
arXiv:2603.09043v1 Announce Type: new Abstract: Machine consciousness evaluations mostly see behavior. For language model agents that behavior is language and tool use. That lets an agent say the right things about itself even when the constraints that should make those...