International Law

LOW Academic European Union

Modularity is the Bedrock of Natural and Artificial Intelligence

arXiv:2602.18960v1 Announce Type: new Abstract: The remarkable performance of modern AI systems has been driven by unprecedented scales of data, computation, and energy -- far exceeding the resources required by human intelligence. This disparity highlights the need for new guiding...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction

arXiv:2602.18968v1 Announce Type: new Abstract: Tool invocation is a core capability of agentic systems, yet failures often arise not from individual tool calls but from how multiple tools are organized and executed together. Existing approaches tightly couple tool execution with...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

When Do LLM Preferences Predict Downstream Behavior?

arXiv:2602.18971v1 Announce Type: new Abstract: Preference-driven behavior in LLMs may be a necessary precondition for AI misalignment such as sandbagging: models cannot strategically pursue misaligned goals unless their behavior is influenced by their preferences. Yet prior work has typically prompted...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

How Far Can We Go with Pixels Alone? A Pilot Study on Screen-Only Navigation in Commercial 3D ARPGs

arXiv:2602.18981v1 Announce Type: new Abstract: Modern 3D game levels rely heavily on visual guidance, yet the navigability of level layouts remains difficult to quantify. Prior work either simulates play in simplified environments or analyzes static screenshots for visual affordances, but...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

arXiv:2602.18985v1 Announce Type: new Abstract: Infrared radiation computing underpins advances in climate science, remote sensing and spectroscopy but remains constrained by manual workflows. We introduce InfEngine, an autonomous intelligent computational engine designed to drive a paradigm shift from human-led orchestration...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight

arXiv:2602.18986v1 Announce Type: new Abstract: Organizations across finance, healthcare, transportation, content moderation, and critical infrastructure are rapidly deploying highly automated AI systems, yet they lack principled methods to quantify how increasing automation amplifies harm when failures occur. We propose a...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Benchmark Test-Time Scaling of General LLM Agents

arXiv:2602.18998v1 Announce Type: new Abstract: LLM agents are increasingly expected to function as general-purpose systems capable of resolving open-ended user requests. While existing benchmarks focus on domain-aware environments for developing specialized agents, evaluating general-purpose agents requires more realistic settings that...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

arXiv:2602.19006v1 Announce Type: new Abstract: We present a systematic evaluation of large language models on quantum mechanics problem-solving. Our study evaluates 15 models from five providers (OpenAI, Anthropic, Google, Alibaba, DeepSeek) spanning three capability tiers on 20 tasks covering derivations,...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Asking the Right Questions: Improving Reasoning with Generated Stepping Stones

arXiv:2602.19069v1 Announce Type: new Abstract: Recent years have witnessed tremendous progress in enabling LLMs to solve complex reasoning tasks such as math and coding. As we start to apply LLMs to harder tasks that they may not be able to...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Defining Explainable AI for Requirements Analysis

arXiv:2602.19071v1 Announce Type: new Abstract: Explainable Artificial Intelligence (XAI) has become popular in the last few years. The Artificial Intelligence (AI) community in general, and the Machine Learning (ML) community in particular, is coming to the realisation that in many...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Post-Routing Arithmetic in Llama-3: Last-Token Result Writing and Rotation-Structured Digit Directions

arXiv:2602.19109v1 Announce Type: new Abstract: We study three-digit addition in Meta-Llama-3-8B (base) under a one-token readout to characterize how arithmetic answers are finalized after cross-token routing becomes causally irrelevant. Causal residual patching and cumulative attention ablations localize a sharp boundary...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

arXiv:2602.19128v1 Announce Type: new Abstract: Optimizing GPU kernels is critical for efficient modern machine learning systems yet remains challenging due to the complex interplay of design factors and rapid hardware evolution. Existing automated approaches typically treat Large Language Models (LLMs)...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

DoAtlas-1: A Causal Compilation Paradigm for Clinical AI

arXiv:2602.19158v1 Announce Type: new Abstract: Medical foundation models generate narrative explanations but cannot quantify intervention effects, detect evidence conflicts, or validate literature claims, limiting clinical auditability. We propose causal compilation, a paradigm that transforms medical evidence from narrative text into...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Beyond Behavioural Trade-Offs: Mechanistic Tracing of Pain-Pleasure Decisions in an LLM

arXiv:2602.19159v1 Announce Type: new Abstract: Prior behavioural work suggests that some LLMs alter choices when options are framed as causing pain or pleasure, and that such deviations can scale with stated intensity. To bridge behavioural evidence (what the model does)...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

arXiv:2602.19160v1 Announce Type: new Abstract: This paper examines the reasoning capabilities of Large Language Models (LLMs) from a novel perspective, focusing on their ability to operate within formally specified, rule-governed environments. We evaluate four LLMs (Gemini 2.5 Pro and Flash...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

arXiv:2602.19223v1 Announce Type: new Abstract: The optimization of urban energy systems is crucial for the advancement of sustainable and resilient smart cities, which are becoming increasingly complex with multiple decision-making units. To address scalability and coordination concerns, Multi-Agent Reinforcement Learning...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts

arXiv:2602.19244v1 Announce Type: new Abstract: On-the-fly Directed Controller Synthesis (OTF-DCS) mitigates state-space explosion by incrementally exploring the system and relies critically on an exploration policy to guide search efficiently. Recent reinforcement learning (RL) approaches learn such policies and achieve promising...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Automated Generation of Microfluidic Netlists using Large Language Models

arXiv:2602.19297v1 Announce Type: new Abstract: Microfluidic devices have emerged as powerful tools in various laboratory applications, but the complexity of their design limits accessibility for many practitioners. While progress has been made in microfluidic design automation (MFDA), a practical and...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

arXiv:2602.19298v1 Announce Type: new Abstract: Evaluating personalized, sequential treatment strategies for Alzheimer's disease (AD) using clinical trials is often impractical due to long disease horizons and substantial inter-patient heterogeneity. To address these constraints, we present the Alzheimer's Learning Platform for...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces

arXiv:2602.19367v1 Announce Type: new Abstract: The Platonic Representation Hypothesis posits that learned representations from models trained on different modalities converge to a shared latent structure of the world. However, this hypothesis has largely been examined in vision and language, and...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Artificial Intelligence for Modeling & Simulation in Digital Twins

arXiv:2602.19390v1 Announce Type: new Abstract: The convergence of modeling & simulation (M&S) and artificial intelligence (AI) is leaving its marks on advanced digital technology. Pertinent examples are digital twins (DTs) - high-fidelity, live representations of physical assets, and frequent enablers...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

arXiv:2602.19416v1 Announce Type: new Abstract: Reinforcement Learning from Human Feedback (RLHF) enables powerful LLM alignment but can introduce reward hacking - models exploit spurious correlations in proxy rewards without genuine alignment. Compounding this, the objectives internalized during RLHF remain opaque,...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Asymptotic Semantic Collapse in Hierarchical Optimization

arXiv:2602.18450v1 Announce Type: new Abstract: Multi-agent language systems can exhibit a failure mode where a shared dominant context progressively absorbs individual semantics, yielding near-uniform behavior across agents. We study this effect under the name Asymptotic Semantic Collapse in Hierarchical Optimization....

1 min 1 month, 2 weeks ago

ear

LOW Academic United Kingdom

From Trial by Fire To Sleep Like a Baby: A Lexicon of Anxiety Associations for 20k English Multiword Expressions

arXiv:2602.18692v1 Announce Type: new Abstract: Anxiety is the unease about a possible future negative outcome. In recent years, there has been growing interest in understanding how anxiety relates to our health, well-being, body, mind, and behaviour. This includes work on...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Semantic Substrate Theory: An Operator-Theoretic Framework for Geometric Semantic Drift

arXiv:2602.18699v1 Announce Type: new Abstract: Most semantic drift studies report multiple signals e.g., embedding displacement, neighbor changes, distributional divergence, and recursive trajectory instability, without a shared explanatory theory that relates them. This paper proposes a formalization of these signals in...

1 min 1 month, 2 weeks ago

icc

LOW Academic European Union

ReHear: Iterative Pseudo-Label Refinement for Semi-Supervised Speech Recognition via Audio Large Language Models

arXiv:2602.18721v1 Announce Type: new Abstract: Semi-supervised learning in automatic speech recognition (ASR) typically relies on pseudo-labeling, which often suffers from confirmation bias and error accumulation due to noisy supervision. To address this limitation, we propose ReHear, a framework for iterative...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

DeepInnovator: Triggering the Innovative Capabilities of LLMs

arXiv:2602.18920v1 Announce Type: new Abstract: The application of Large Language Models (LLMs) in accelerating scientific discovery has garnered increasing attention, with a key focus on constructing research agents endowed with innovative capability, i.e., the ability to autonomously generate novel and...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Why Agent Caching Fails and How to Fix It: Structured Intent Canonicalization with Few-Shot Learning

arXiv:2602.18922v1 Announce Type: new Abstract: Personal AI agents incur substantial cost via repeated LLM calls. We show existing caching methods fail: GPTCache achieves 37.9% accuracy on real benchmarks; APC achieves 0-12%. The root cause is optimizing for the wrong property...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

arXiv:2602.18964v1 Announce Type: new Abstract: Sarcasm detection poses a fundamental challenge in computational semantics, requiring models to resolve disparities between literal and intended meaning. The challenge is amplified in low-resource languages where annotated datasets are scarce or nonexistent. We present...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Capable but Unreliable: Canonical Path Deviation as a Causal Mechanism of Agent Failure in Long-Horizon Tasks

arXiv:2602.19008v1 Announce Type: new Abstract: Why do language agents fail on tasks they are capable of solving? We argue that many such failures are reliability failures caused by stochastic drift from a task's latent solution structure, not capability failures. Every...

1 min 1 month, 2 weeks ago

ear

Modularity is the Bedrock of Natural and Artificial Intelligence

Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction

When Do LLM Preferences Predict Downstream Behavior?

How Far Can We Go with Pixels Alone? A Pilot Study on Screen-Only Navigation in Commercial 3D ARPGs

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight

Benchmark Test-Time Scaling of General LLM Agents

Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

Asking the Right Questions: Improving Reasoning with Generated Stepping Stones

Defining Explainable AI for Requirements Analysis

Post-Routing Arithmetic in Llama-3: Last-Token Result Writing and Rotation-Structured Digit Directions

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

DoAtlas-1: A Causal Compilation Paradigm for Clinical AI

Beyond Behavioural Trade-Offs: Mechanistic Tracing of Pain-Pleasure Decisions in an LLM

Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts

Automated Generation of Microfluidic Netlists using Large Language Models

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces

Artificial Intelligence for Modeling & Simulation in Digital Twins

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

Asymptotic Semantic Collapse in Hierarchical Optimization

From Trial by Fire To Sleep Like a Baby: A Lexicon of Anxiety Associations for 20k English Multiword Expressions

Semantic Substrate Theory: An Operator-Theoretic Framework for Geometric Semantic Drift

ReHear: Iterative Pseudo-Label Refinement for Semi-Supervised Speech Recognition via Audio Large Language Models

DeepInnovator: Triggering the Innovative Capabilities of LLMs

Why Agent Caching Fails and How to Fix It: Structured Intent Canonicalization with Few-Shot Learning

Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

Capable but Unreliable: Canonical Path Deviation as a Causal Mechanism of Agent Failure in Long-Horizon Tasks

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.