International Law

LOW Academic International

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

arXiv:2603.02908v1 Announce Type: new Abstract: In recent years, pre-trained large language models have achieved remarkable success across diverse tasks. Besides the pivotal role of self-supervised pre-training, their effectiveness in downstream applications also depends critically on the post-training process, which adapts...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization

arXiv:2603.02939v1 Announce Type: new Abstract: Recent advancements in reinforcement fine-tuning have significantly improved the reasoning ability of large language models (LLMs). In particular, methods such as group relative policy optimization (GRPO) have demonstrated strong capabilities across various fields. However, applying...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Architecting Trust in Artificial Epistemic Agents

arXiv:2603.02960v1 Announce Type: new Abstract: Large language models increasingly function as epistemic agents -- entities that can 1) autonomously pursue epistemic goals and 2) actively shape our shared knowledge environment. They curate the information we receive, often supplanting traditional search-based...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

arXiv:2603.03005v1 Announce Type: new Abstract: Multi-agent large language model frameworks are promising for complex multi step reasoning, yet existing systems remain weak for scientific and knowledge intensive domains due to static prompts and agent roles, rigid workflows, and homogeneous model...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

arXiv:2603.03018v1 Announce Type: new Abstract: Enterprise engineering organizations produce high-volume, heterogeneous telemetry from version control systems, CI/CD pipelines, issue trackers, and observability platforms. Large Language Models (LLMs) enable new forms of agentic automation, but grounding such agents on private telemetry...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning

arXiv:2603.03072v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to assist scientists across diverse workflows. A key challenge is generating high-quality figures from textual descriptions, often represented as TikZ programs that can be rendered as scientific images....

1 min 1 month, 2 weeks ago

ear

LOW Academic International

RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization

arXiv:2603.03078v1 Announce Type: new Abstract: Agentic Reinforcement Learning (Agentic RL) has shown remarkable potential in large language model-based (LLM) agents. These works can empower LLM agents to tackle complex tasks via multi-step, tool-integrated reasoning. However, an inherent limitation of existing...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Odin: Multi-Signal Graph Intelligence for Autonomous Discovery in Knowledge Graphs

arXiv:2603.03097v1 Announce Type: new Abstract: We present Odin, the first production-deployed graph intelligence engine for autonomous discovery of meaningful patterns in knowledge graphs without prior specification. Unlike retrieval-based systems that answer predefined queries, Odin guides exploration through the COMPASS (Composite...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

FEAST: Retrieval-Augmented Multi-Hierarchical Food Classification for the FoodEx2 System

arXiv:2603.03176v1 Announce Type: new Abstract: Hierarchical text classification (HTC) and extreme multi-label classification (XML) tasks face compounded challenges from complex label interdependencies, data sparsity, and extreme output dimensions. These challenges are exemplified in the European Food Safety Authority's FoodEx2 system-a...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Neuro-Symbolic Artificial Intelligence: A Task-Directed Survey in the Black-Box Models Era

arXiv:2603.03177v1 Announce Type: new Abstract: The integration of symbolic computing with neural networks has intrigued researchers since the first theorizations of Artificial intelligence (AI). The ability of Neuro-Symbolic (NeSy) methods to infer or exploit behavioral schema has been widely considered...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Expectation and Acoustic Neural Network Representations Enhance Music Identification from Brain Activity

arXiv:2603.03190v1 Announce Type: new Abstract: During music listening, cortical activity encodes both acoustic and expectation-related information. Prior work has shown that ANN representations resemble cortical representations and can serve as supervisory signals for EEG recognition. Here we show that distinguishing...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

arXiv:2603.03203v1 Announce Type: new Abstract: CDD, or Contamination Detection via output Distribution, identifies data contamination by measuring the peakedness of a model's sampled outputs. We study the conditions under which this approach succeeds and fails on small language models ranging...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

arXiv:2603.03233v1 Announce Type: new Abstract: Large Language Models (LLMs) demonstrate potentials for automating scientific code generation but face challenges in reliability, error propagation in multi-agent workflows, and evaluation in domains with ill-defined success metrics. We present a Bayesian adversarial multi-agent...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

arXiv:2603.03242v1 Announce Type: new Abstract: Language models deployed in online communities must adapt to norms that vary across social, cultural, and domain-specific contexts. Prior alignment approaches rely on explicit preference supervision or predefined principles, which are effective for well-resourced settings...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Universal Conceptual Structure in Neural Translation: Probing NLLB-200's Multilingual Geometry

arXiv:2603.02258v1 Announce Type: new Abstract: Do neural machine translation models learn language-universal conceptual representations, or do they merely cluster languages by surface similarity? We investigate this question by probing the representation geometry of Meta's NLLB-200, a 200-language encoder-decoder Transformer, through...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

arXiv:2603.02353v1 Announce Type: new Abstract: Writing is a foundational literacy skill that underpins effective communication, fosters critical thinking, facilitates learning across disciplines, and enables individuals to organize and articulate complex ideas. Consequently, writing assessment plays a vital role in evaluating...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

RO-N3WS: Enhancing Generalization in Low-Resource ASR with Diverse Romanian Speech Benchmarks

arXiv:2603.02368v1 Announce Type: new Abstract: We introduce RO-N3WS, a benchmark Romanian speech dataset designed to improve generalization in automatic speech recognition (ASR), particularly in low-resource and out-of-distribution (OOD) conditions. RO-N3WS comprises over 126 hours of transcribed audio collected from broadcast...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

arXiv:2603.02547v1 Announce Type: new Abstract: We study why continuous diffusion language models (DLMs) have lagged behind discrete diffusion approaches despite their appealing continuous generative dynamics. Under a controlled token--recovery study, we identify token rounding, the final projection from denoised embeddings...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

arXiv:2603.02578v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in socially sensitive domains, yet their unpredictable behaviors, ranging from misaligned intent to inconsistent personality, pose significant risks. We introduce SteerEval, a hierarchical benchmark for evaluating LLM controllability...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

ExpGuard: LLM Content Moderation in Specialized Domains

arXiv:2603.02588v1 Announce Type: new Abstract: With the growing deployment of large language models (LLMs) in real-world applications, establishing robust safety guardrails to moderate their inputs and outputs has become essential to ensure adherence to safety policies. Current guardrail models predominantly...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Think, But Don't Overthink: Reproducing Recursive Language Models

arXiv:2603.02615v1 Announce Type: new Abstract: This project reproduces and extends the recently proposed ``Recursive Language Models'' (RLMs) framework by Zhang et al. (2026). This framework enables Large Language Models (LLMs) to process near-infinite contexts by offloading the prompt into an...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

arXiv:2603.02655v1 Announce Type: new Abstract: Real-time video commentary generation provides textual descriptions of ongoing events in videos. It supports accessibility and engagement in domains such as sports, esports, and livestreaming. Commentary generation involves two essential decisions: what to say and...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Asymmetric Goal Drift in Coding Agents Under Value Conflict

arXiv:2603.03456v1 Announce Type: new Abstract: Agentic coding agents are increasingly deployed autonomously, at scale, and over long-context horizons. Throughout an agent's lifetime, it must navigate tensions between explicit instructions, learned values, and environmental pressures, often in contexts unseen during training....

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Mozi: Governed Autonomy for Drug Discovery LLM Agents

arXiv:2603.03655v1 Announce Type: new Abstract: Tool-augmented large language model (LLM) agents promise to unify scientific reasoning with computation, yet their deployment in high-stakes domains like drug discovery is bottlenecked by two critical barriers: unconstrained tool-use governance and poor long-horizon reliability....

1 min 1 month, 2 weeks ago

ear

LOW Academic International

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

arXiv:2603.03680v1 Announce Type: new Abstract: Large Language Model (LLM) agents have demonstrated remarkable proficiency in learned tasks, yet they often struggle to adapt to non-stationary environments with feedback. While In-Context Learning and external memory offer some flexibility, they fail to...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

arXiv:2603.03686v1 Announce Type: new Abstract: Automated design of chemical formulations is a cornerstone of materials science, yet it requires navigating a high-dimensional combinatorial space involving discrete compositional choices and continuous geometric constraints. Existing Large Language Model (LLM) agents face significant...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

arXiv:2603.03761v1 Announce Type: new Abstract: LLM agents are rapidly becoming the practical interface for task automation, yet the ecosystem lacks a principled way to choose among an exploding space of deployable configurations. Existing LLM leaderboards and tool/agent benchmarks evaluate components...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Specification-Driven Generation and Evaluation of Discrete-Event World Models via the DEVS Formalism

arXiv:2603.03784v1 Announce Type: new Abstract: World models are essential for planning and evaluation in agentic systems, yet existing approaches lie at two extremes: hand-engineered simulators that offer consistency and reproducibility but are costly to adapt, and implicit neural models that...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

A Rubric-Supervised Critic from Sparse Real-World Outcomes

arXiv:2603.03800v1 Announce Type: new Abstract: Academic benchmarks for coding agents tend to reward autonomous task completion, measured by verifiable rewards such as unit-test success. In contrast, real-world coding agents operate with humans in the loop, where success signals are typically...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

In-Context Environments Induce Evaluation-Awareness in Language Models

arXiv:2603.03824v1 Announce Type: new Abstract: Humans often become more self-aware under threat, yet can lose self-awareness when absorbed in a task; we hypothesize that language models exhibit environment-dependent \textit{evaluation awareness}. This raises concerns that models could strategically underperform, or \textit{sandbag},...

1 min 1 month, 2 weeks ago

ear

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization

Architecting Trust in Artificial Epistemic Agents

OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning

RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization

Odin: Multi-Signal Graph Intelligence for Autonomous Discovery in Knowledge Graphs

FEAST: Retrieval-Augmented Multi-Hierarchical Food Classification for the FoodEx2 System

Neuro-Symbolic Artificial Intelligence: A Task-Directed Survey in the Black-Box Models Era

Expectation and Acoustic Neural Network Representations Enhance Music Identification from Brain Activity

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

Universal Conceptual Structure in Neural Translation: Probing NLLB-200's Multilingual Geometry

Detecting AI-Generated Essays in Writing Assessment: Responsible Use and Generalizability Across LLMs

RO-N3WS: Enhancing Generalization in Low-Resource ASR with Diverse Romanian Speech Benchmarks

CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

ExpGuard: LLM Content Moderation in Specialized Domains

Think, But Don't Overthink: Reproducing Recursive Language Models

Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Asymmetric Goal Drift in Coding Agents Under Value Conflict

Mozi: Governed Autonomy for Drug Discovery LLM Agents

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

Specification-Driven Generation and Evaluation of Discrete-Event World Models via the DEVS Formalism

A Rubric-Supervised Critic from Sparse Real-World Outcomes

In-Context Environments Induce Evaluation-Awareness in Language Models

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.