International Law

LOW Academic European Union

Expectation and Acoustic Neural Network Representations Enhance Music Identification from Brain Activity

arXiv:2603.03190v1 Announce Type: new Abstract: During music listening, cortical activity encodes both acoustic and expectation-related information. Prior work has shown that ANN representations resemble cortical representations and can serve as supervisory signals for EEG recognition. Here we show that distinguishing...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

arXiv:2603.03203v1 Announce Type: new Abstract: CDD, or Contamination Detection via output Distribution, identifies data contamination by measuring the peakedness of a model's sampled outputs. We study the conditions under which this approach succeeds and fails on small language models ranging...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

arXiv:2603.03233v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate potentials for automating scientific code generation but face challenges in reliability, error propagation in multi-agent workflows, and evaluation in domains with ill-defined success metrics. We present a Bayesian adversarial multi-agent...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

arXiv:2603.03242v1 Announce Type: new Abstract: Language models deployed in online communities must adapt to norms that vary across social, cultural, and domain-specific contexts. Prior alignment approaches rely on explicit preference supervision or predefined principles, which are effective for well-resourced settings...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Universal Conceptual Structure in Neural Translation: Probing NLLB-200's Multilingual Geometry

arXiv:2603.02258v1 Announce Type: new Abstract: Do neural machine translation models learn language-universal conceptual representations, or do they merely cluster languages by surface similarity? We investigate this question by probing the representation geometry of Meta's NLLB-200, a 200-language encoder-decoder Transformer, through...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

arXiv:2603.02353v1 Announce Type: new Abstract: Writing is a foundational literacy skill that underpins effective communication, fosters critical thinking, facilitates learning across disciplines, and enables individuals to organize and articulate complex ideas. Consequently, writing assessment plays a vital role in evaluating...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

RO-N3WS: Enhancing Generalization in Low-Resource ASR with Diverse Romanian Speech Benchmarks

arXiv:2603.02368v1 Announce Type: new Abstract: We introduce RO-N3WS, a benchmark Romanian speech dataset designed to improve generalization in automatic speech recognition (ASR), particularly in low-resource and out-of-distribution (OOD) conditions. RO-N3WS comprises over 126 hours of transcribed audio collected from broadcast...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

arXiv:2603.02547v1 Announce Type: new Abstract: We study why continuous diffusion language models (DLMs) have lagged behind discrete diffusion approaches despite their appealing continuous generative dynamics. Under a controlled token--recovery study, we identify token rounding, the final projection from denoised embeddings...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

arXiv:2603.02578v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in socially sensitive domains, yet their unpredictable behaviors, ranging from misaligned intent to inconsistent personality, pose significant risks. We introduce SteerEval, a hierarchical benchmark for evaluating LLM controllability...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

ExpGuard: LLM Content Moderation in Specialized Domains

arXiv:2603.02588v1 Announce Type: new Abstract: With the growing deployment of large language models (LLMs) in real-world applications, establishing robust safety guardrails to moderate their inputs and outputs has become essential to ensure adherence to safety policies. Current guardrail models predominantly...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Think, But Don't Overthink: Reproducing Recursive Language Models

arXiv:2603.02615v1 Announce Type: new Abstract: This project reproduces and extends the recently proposed ``Recursive Language Models'' (RLMs) framework by Zhang et al. (2026). This framework enables Large Language Models (LLMs) to process near-infinite contexts by offloading the prompt into an...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

arXiv:2603.02655v1 Announce Type: new Abstract: Real-time video commentary generation provides textual descriptions of ongoing events in videos. It supports accessibility and engagement in domains such as sports, esports, and livestreaming. Commentary generation involves two essential decisions: what to say and...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Asymmetric Goal Drift in Coding Agents Under Value Conflict

arXiv:2603.03456v1 Announce Type: new Abstract: Agentic coding agents are increasingly deployed autonomously, at scale, and over long-context horizons. Throughout an agent's lifetime, it must navigate tensions between explicit instructions, learned values, and environmental pressures, often in contexts unseen during training....

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Mozi: Governed Autonomy for Drug Discovery LLM Agents

arXiv:2603.03655v1 Announce Type: new Abstract: Tool-augmented large language model (LLM) agents promise to unify scientific reasoning with computation, yet their deployment in high-stakes domains like drug discovery is bottlenecked by two critical barriers: unconstrained tool-use governance and poor long-horizon reliability....

1 min 1 month, 2 weeks ago

ear

LOW Academic International

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

arXiv:2603.03680v1 Announce Type: new Abstract: Large Language Model (LLM) agents have demonstrated remarkable proficiency in learned tasks, yet they often struggle to adapt to non-stationary environments with feedback. While In-Context Learning and external memory offer some flexibility, they fail to...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

arXiv:2603.03686v1 Announce Type: new Abstract: Automated design of chemical formulations is a cornerstone of materials science, yet it requires navigating a high-dimensional combinatorial space involving discrete compositional choices and continuous geometric constraints. Existing Large Language Model (LLM) agents face significant...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

arXiv:2603.03761v1 Announce Type: new Abstract: LLM agents are rapidly becoming the practical interface for task automation, yet the ecosystem lacks a principled way to choose among an exploding space of deployable configurations. Existing LLM leaderboards and tool/agent benchmarks evaluate components...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Specification-Driven Generation and Evaluation of Discrete-Event World Models via the DEVS Formalism

arXiv:2603.03784v1 Announce Type: new Abstract: World models are essential for planning and evaluation in agentic systems, yet existing approaches lie at two extremes: hand-engineered simulators that offer consistency and reproducibility but are costly to adapt, and implicit neural models that...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

A Rubric-Supervised Critic from Sparse Real-World Outcomes

arXiv:2603.03800v1 Announce Type: new Abstract: Academic benchmarks for coding agents tend to reward autonomous task completion, measured by verifiable rewards such as unit-test success. In contrast, real-world coding agents operate with humans in the loop, where success signals are typically...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

In-Context Environments Induce Evaluation-Awareness in Language Models

arXiv:2603.03824v1 Announce Type: new Abstract: Humans often become more self-aware under threat, yet can lose self-awareness when absorbed in a task; we hypothesize that language models exhibit environment-dependent \textit{evaluation awareness}. This raises concerns that models could strategically underperform, or \textit{sandbag},...

1 min 1 month, 2 weeks ago

ear

LOW Journal United States

ASIL - American Society of International Law | Washington D.C. DC

ASIL - American Society of International Law, Washington D. C. 22,064 likes · 14 talking about this · 72 were here. The mission of the American Society...

1 min 1 month, 2 weeks ago

international law

LOW Academic International

Phi-4-reasoning-vision-15B Technical Report

arXiv:2603.03975v1 Announce Type: new Abstract: We present Phi-4-reasoning-vision-15B, a compact open-weight multimodal reasoning model, and share the motivations, design choices, experiments, and learnings that informed its development. Our goal is to contribute practical insight to the research community on building...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning

arXiv:2603.04124v1 Announce Type: new Abstract: Can reinforcement learning with hard, verifiable rewards teach a compact language model to reason about physics, or does it primarily learn to pattern-match toward correct answers? We study this question by training a 1.5B-parameter reasoning...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

arXiv:2603.04191v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly serving as personal assistants, where users share complex and diverse preferences over extended interactions. However, assessing how well LLMs can follow these preferences in realistic, long-term situations remains underexplored....

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Agentics 2.0: Logical Transduction Algebra for Agentic Data Workflows

arXiv:2603.04241v1 Announce Type: new Abstract: Agentic AI is rapidly transitioning from research prototypes to enterprise deployments, where requirements extend to meet the software quality attributes of reliability, scalability, and observability beyond plausible text generation. We present Agentics 2.0, a lightweight,...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

$\tau$-Knowledge: Evaluating Conversational Agents over Unstructured Knowledge

arXiv:2603.04370v1 Announce Type: new Abstract: Conversational agents are increasingly deployed in knowledge-intensive settings, where correct behavior depends on retrieving and applying domain-specific knowledge from large, proprietary, and unstructured corpora during live interactions with users. Yet most existing benchmarks evaluate retrieval...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography

arXiv:2603.04457v1 Announce Type: new Abstract: The fundamental topology of manufacturing has not undergone a paradigm-level transformation since Henry Ford's moving assembly line in 1913. Every major innovation of the past century, from the Toyota Production System to Industry 4.0, has...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

arXiv:2603.04514v1 Announce Type: new Abstract: Diffusion language models generate text through iterative denoising under a uniform refinement rule applied to all tokens. However, tokens stabilize at different rates in practice, leading to substantial redundant refinement and motivating refinement control over...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Discovering mathematical concepts through a multi-agent system

arXiv:2603.04528v1 Announce Type: new Abstract: Mathematical concepts emerge through an interplay of processes, including experimentation, efforts at proof, and counterexamples. In this paper, we present a new multi-agent model for computational mathematical discovery based on this observation. Our system, conceived...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Adaptive Memory Admission Control for LLM Agents

arXiv:2603.04549v1 Announce Type: new Abstract: LLM-based agents increasingly rely on long-term memory to support multi-session reasoning and interaction, yet current systems provide little control over what information is retained. In practice, agents either accumulate large volumes of conversational content, including...

1 min 1 month, 2 weeks ago

ear

Expectation and Acoustic Neural Network Representations Enhance Music Identification from Brain Activity

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

Universal Conceptual Structure in Neural Translation: Probing NLLB-200's Multilingual Geometry

Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

RO-N3WS: Enhancing Generalization in Low-Resource ASR with Diverse Romanian Speech Benchmarks

CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

ExpGuard: LLM Content Moderation in Specialized Domains

Think, But Don't Overthink: Reproducing Recursive Language Models

Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Asymmetric Goal Drift in Coding Agents Under Value Conflict

Mozi: Governed Autonomy for Drug Discovery LLM Agents

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

Specification-Driven Generation and Evaluation of Discrete-Event World Models via the DEVS Formalism

A Rubric-Supervised Critic from Sparse Real-World Outcomes

In-Context Environments Induce Evaluation-Awareness in Language Models

ASIL - American Society of International Law | Washington D.C. DC

Phi-4-reasoning-vision-15B Technical Report

BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

Agentics 2.0: Logical Transduction Algebra for Agentic Data Workflows

$\tau$-Knowledge: Evaluating Conversational Agents over Unstructured Knowledge

Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

Discovering mathematical concepts through a multi-agent system

Adaptive Memory Admission Control for LLM Agents

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.