International Law

LOW Academic European Union

BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

arXiv:2602.13214v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly deployed in interactive environments requiring strategic decision-making, yet systematic evaluation of these capabilities remains challenging. Existing benchmarks for LLMs primarily assess static reasoning through isolated tasks and fail to...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching

arXiv:2602.13215v1 Announce Type: new Abstract: Transformers allocate uniform computation to every position, regardless of difficulty. State Space Models (SSMs) offer efficient alternatives but struggle with precise information retrieval over a long horizon. Inspired by dual-process theories of cognition (Kahneman, 2011),...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

VeRA: Verified Reasoning Data Augmentation at Scale

arXiv:2602.13217v1 Announce Type: new Abstract: The main issue with most evaluation schemes today is their "static" nature: the same problems are reused repeatedly, allowing for memorization, format exploitation, and eventual saturation. To measure genuine AI progress, we need evaluation that...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning

arXiv:2602.13218v1 Announce Type: new Abstract: Scaling verifiable training signals remains a key bottleneck for Reinforcement Learning from Verifiable Rewards (RLVR). Logical reasoning is a natural substrate: constraints are formal and answers are programmatically checkable. However, prior synthesis pipelines either depend...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Intelligence as Trajectory-Dominant Pareto Optimization

arXiv:2602.13230v1 Announce Type: new Abstract: Despite recent advances in artificial intelligence, many systems exhibit stagnation in long-horizon adaptability despite continued performance optimization. This work argues that such limitations do not primarily arise from insufficient learning, data, or model capacity, but...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

arXiv:2602.13235v1 Announce Type: new Abstract: Visual Retrieval-Augmented Generation (VRAG) enhances Vision-Language Models (VLMs) by incorporating external visual documents to address a given query. Existing VRAG frameworks usually depend on rigid, pre-defined external tools to extend the perceptual capabilities of VLMs,...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

arXiv:2602.13237v1 Announce Type: new Abstract: Automated reasoning is critical in domains such as law and governance, where verifying claims against facts in documents requires both accuracy and interpretability. Recent work adopts structured reasoning pipelines that translate natural language into first-order...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

X-Blocks: Linguistic Building Blocks of Natural Language Explanations for Automated Vehicles

arXiv:2602.13248v1 Announce Type: new Abstract: Natural language explanations play a critical role in establishing trust and acceptance of automated vehicles (AVs), yet existing approaches lack systematic frameworks for analysing how humans linguistically construct driving rationales across diverse scenarios. This paper...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

MAPLE: A Sub-Agent Architecture for Memory, Learning, and Personalization in Agentic AI Systems

arXiv:2602.13258v1 Announce Type: new Abstract: Large language model (LLM) agents have emerged as powerful tools for complex tasks, yet their ability to adapt to individual users remains fundamentally limited. We argue this limitation stems from a critical architectural conflation: current...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

General learned delegation by clones

arXiv:2602.13262v1 Announce Type: new Abstract: Frontier language models improve with additional test-time computation, but serial reasoning or uncoordinated parallel sampling can be compute-inefficient under fixed inference budgets. We propose SELFCEST, which equips a base model with the ability to spawn...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Human-Centered Explainable AI for Security Enhancement: A Deep Intrusion Detection Framework

arXiv:2602.13271v1 Announce Type: new Abstract: The increasing complexity and frequency of cyber-threats demand intrusion detection systems (IDS) that are not only accurate but also interpretable. This paper presented a novel IDS framework that integrated Explainable Artificial Intelligence (XAI) to enhance...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

arXiv:2602.13272v1 Announce Type: new Abstract: It is unclear whether strong forecasting performance reflects genuine temporal understanding or the ability to reason under contextual and event-driven conditions. We introduce TemporalBench, a multi-domain benchmark designed to evaluate temporal reasoning behavior under progressively...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Artificial Organisations

arXiv:2602.13275v1 Announce Type: new Abstract: Alignment research focuses on making individual AI systems reliable. Human institutions achieve reliable collective behaviour differently: they mitigate the risk posed by misaligned individuals through organisational structure. Multi-agent AI systems should follow this institutional model...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

BEAGLE: Behavior-Enforced Agent for Grounded Learner Emulation

arXiv:2602.13280v1 Announce Type: new Abstract: Simulating student learning behaviors in open-ended problem-solving environments holds potential for education research, from training adaptive tutoring systems to stress-testing pedagogical interventions. However, collecting authentic data is challenging due to privacy concerns and the high...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Mirror: A Multi-Agent System for AI-Assisted Ethics Review

arXiv:2602.13292v1 Announce Type: new Abstract: Ethics review is a foundational mechanism of modern research governance, yet contemporary systems face increasing strain as ethical risks arise as structural consequences of large-scale, interdisciplinary scientific practice. The demand for consistent and defensible decisions...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Situation Graph Prediction: Structured Perspective Inference for User Modeling

arXiv:2602.13319v1 Announce Type: new Abstract: Perspective-Aware AI requires modeling evolving internal states--goals, emotions, contexts--not merely preferences. Progress is limited by a data bottleneck: digital footprints are privacy-sensitive and perspective states are rarely labeled. We propose Situation Graph Prediction (SGP), a...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Information Fidelity in Tool-Using LLM Agents: A Martingale Analysis of the Model Context Protocol

arXiv:2602.13320v1 Announce Type: new Abstract: As AI agents powered by large language models (LLMs) increasingly use external tools for high-stakes decisions, a critical reliability question arises: how do errors propagate across sequential tool calls? We introduce the first theoretical framework...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

arXiv:2602.13321v1 Announce Type: new Abstract: Detecting jailbreak attempts in clinical training large language models (LLMs) requires accurate modeling of linguistic deviations that signal unsafe or off-task user behavior. Prior work on the 2-Sigma clinical simulation platform showed that manually annotated...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Contrastive explanations of BDI agents

arXiv:2602.13323v1 Announce Type: new Abstract: The ability of autonomous systems to provide explanations is important for supporting transparency and aiding the development of (appropriate) trust. Prior work has defined a mechanism for Belief-Desire-Intention (BDI) agents to be able to answer...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

arXiv:2602.13367v1 Announce Type: new Abstract: We present Nanbeige4.1-3B, a unified generalist language model that simultaneously achieves strong agentic behavior, code generation, and general reasoning with only 3B parameters. To the best of our knowledge, it is the first open-source small...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

On-Policy Supervised Fine-Tuning for Efficient Reasoning

arXiv:2602.13407v1 Announce Type: new Abstract: Large reasoning models (LRMs) are commonly trained with reinforcement learning (RL) to explore long chain-of-thought reasoning, achieving strong performance at high computational cost. Recent methods add multi-reward objectives to jointly optimize correctness and brevity, but...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

NeuroWeaver: An Autonomous Evolutionary Agent for Exploring the Programmatic Space of EEG Analysis Pipelines

arXiv:2602.13473v1 Announce Type: new Abstract: Although foundation models have demonstrated remarkable success in general domains, the application of these models to electroencephalography (EEG) analysis is constrained by substantial data requirements and high parameterization. These factors incur prohibitive computational costs, thereby...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

OMNI-LEAK: Orchestrator Multi-Agent Network Induced Data Leakage

arXiv:2602.13477v1 Announce Type: new Abstract: As Large Language Model (LLM) agents become more capable, their coordinated use in the form of multi-agent systems is anticipated to emerge as a practical paradigm. Prior work has examined the safety and misuse risks...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

OpAgent: Operator Agent for Web Navigation

arXiv:2602.13559v1 Announce Type: new Abstract: To fulfill user instructions, autonomous web agents must contend with the inherent complexity and volatile nature of real-world websites. Conventional paradigms predominantly rely on Supervised Fine-Tuning (SFT) or Offline Reinforcement Learning (RL) using static datasets....

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Differentiable Rule Induction from Raw Sequence Inputs

arXiv:2602.13583v1 Announce Type: new Abstract: Rule learning-based models are widely used in highly interpretable scenarios due to their transparent structures. Inductive logic programming (ILP), a form of machine learning, induces rules from facts while maintaining interpretability. Differentiable ILP models enhance...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

A First Proof Sprint

arXiv:2602.13587v1 Announce Type: new Abstract: This monograph reports a multi-agent proof sprint on ten research-level problems, combining rapid draft generation with adversarial verification, targeted repair, and explicit provenance. The workflow uses wiring-diagram decompositions of claim dependencies to localize gaps and...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

arXiv:2602.13594v1 Announce Type: new Abstract: Agentic AI require persistent memory to store user-specific histories beyond the limited context window of LLMs. Existing memory systems use dense vector databases or knowledge-graph traversal (or hybrid), incurring high retrieval latency and poor storage...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

The Quantization Trap: Breaking Linear Scaling Laws in Multi-Hop Reasoning

arXiv:2602.13595v1 Announce Type: new Abstract: Neural scaling laws provide a predictable recipe for AI advancement: reducing numerical precision should linearly improve computational efficiency and energy profile (E proportional to bits). In this paper, we demonstrate that this scaling law breaks...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval

arXiv:2602.13639v1 Announce Type: new Abstract: With recent breakthroughs in large language models (LLMs) for reasoning, planning, and complex task generation, artificial intelligence systems are transitioning from isolated single-agent architectures to multi-agent systems with collaborative intelligence. However, in heterogeneous multi-agent systems...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Building Autonomous GUI Navigation via Agentic-Q Estimation and Step-Wise Policy Optimization

arXiv:2602.13653v1 Announce Type: new Abstract: Recent advances in Multimodal Large Language Models (MLLMs) have substantially driven the progress of autonomous agents for Graphical User Interface (GUI). Nevertheless, in real-world applications, GUI agents are often faced with non-stationary environments, leading to...

1 min 1 month, 2 weeks ago

ear

BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching

VeRA: Verified Reasoning Data Augmentation at Scale

Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning

Intelligence as Trajectory-Dominant Pareto Optimization

Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

X-Blocks: Linguistic Building Blocks of Natural Language Explanations for Automated Vehicles

MAPLE: A Sub-Agent Architecture for Memory, Learning, and Personalization in Agentic AI Systems

General learned delegation by clones

Human-Centered Explainable AI for Security Enhancement: A Deep Intrusion Detection Framework

TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

Artificial Organisations

BEAGLE: Behavior-Enforced Agent for Grounded Learner Emulation

Mirror: A Multi-Agent System for AI-Assisted Ethics Review

Situation Graph Prediction: Structured Perspective Inference for User Modeling

Information Fidelity in Tool-Using LLM Agents: A Martingale Analysis of the Model Context Protocol

Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

Contrastive explanations of BDI agents

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

On-Policy Supervised Fine-Tuning for Efficient Reasoning

NeuroWeaver: An Autonomous Evolutionary Agent for Exploring the Programmatic Space of EEG Analysis Pipelines

OMNI-LEAK: Orchestrator Multi-Agent Network Induced Data Leakage

OpAgent: Operator Agent for Web Navigation

Differentiable Rule Induction from Raw Sequence Inputs

A First Proof Sprint

Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

The Quantization Trap: Breaking Linear Scaling Laws in Multi-Hop Reasoning

Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval

Building Autonomous GUI Navigation via Agentic-Q Estimation and Step-Wise Policy Optimization

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.