Arbitration

LOW Academic International

Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens

arXiv:2602.13517v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated impressive reasoning capabilities by scaling test-time compute via long Chain-of-Thought (CoT). However, recent findings suggest that raw token counts are unreliable proxies for reasoning quality: increased generation length does...

1 min 1 month, 1 week ago

bit

LOW Academic International

Context Shapes LLMs Retrieval-Augmented Fact-Checking Effectiveness

arXiv:2602.14044v1 Announce Type: new Abstract: Large language models (LLMs) show strong reasoning abilities across diverse tasks, yet their performance on extended contexts remains inconsistent. While prior research has emphasized mid-context degradation in question answering, this study examines the impact of...

1 min 1 month, 1 week ago

bit

LOW Academic International

Mind the (DH) Gap! A Contrast in Risky Choices Between Reasoning and Conversational LLMs

arXiv:2602.15173v1 Announce Type: new Abstract: The use of large language models either as decision support systems, or in agentic workflows, is rapidly transforming the digital ecosystem. However, the understanding of LLM decision-making under uncertainty remains limited. We initiate a comparative...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge

arXiv:2602.17826v1 Announce Type: new Abstract: Language models exhibit fundamental limitations -- hallucination, brittleness, and lack of formal grounding -- that are particularly problematic in high-stakes specialist fields requiring verifiable reasoning. I investigate whether formal domain ontologies can enhance language model...

1 min 1 month, 1 week ago

bit

LOW Academic International

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

arXiv:2602.17687v1 Announce Type: cross Abstract: AI systems have achieved remarkable success in processing text and relational data, yet visual document processing remains relatively underexplored. Whereas traditional systems require OCR transcriptions to convert these visual documents into text and metadata, recent...

1 min 1 month, 1 week ago

bit

LOW Academic International

A Case Study of Selected PTQ Baselines for Reasoning LLMs on Ascend NPU

arXiv:2602.17693v1 Announce Type: cross Abstract: Post-Training Quantization (PTQ) is crucial for efficient model deployment, yet its effectiveness on Ascend NPU remains under-explored compared to GPU architectures. This paper presents a case study of representative PTQ baselines applied to reasoning-oriented models...

1 min 1 month, 1 week ago

bit

LOW Academic International

Can LLM Safety Be Ensured by Constraining Parameter Regions?

arXiv:2602.17696v1 Announce Type: cross Abstract: Large language models (LLMs) are often assumed to contain ``safety regions'' -- parameter subsets whose modification directly influences safety behaviors. We conduct a systematic evaluation of four safety region identification methods spanning different parameter granularities,...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs

arXiv:2602.17698v1 Announce Type: cross Abstract: Post-training weight quantization is crucial for reducing the memory and inference cost of large language models (LLMs), yet pushing the average precision below 4 bits remains challenging due to highly non-uniform weight sensitivity and the...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Inelastic Constitutive Kolmogorov-Arnold Networks: A generalized framework for automated discovery of interpretable inelastic material models

arXiv:2602.17750v1 Announce Type: cross Abstract: A key problem of solid mechanics is the identification of the constitutive law of a material, that is, the relation between strain and stress. Machine learning has lead to considerable advances in this field lately....

1 min 1 month, 1 week ago

bit

LOW Academic International

TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models

arXiv:2602.18884v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs), particularly smaller, deployable variants, exhibit a critical deficiency in understanding temporal and procedural visual data, a bottleneck hindering their application in real-world embodied AI. This gap is largely caused by...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Modularity is the Bedrock of Natural and Artificial Intelligence

arXiv:2602.18960v1 Announce Type: new Abstract: The remarkable performance of modern AI systems has been driven by unprecedented scales of data, computation, and energy -- far exceeding the resources required by human intelligence. This disparity highlights the need for new guiding...

1 min 1 month, 1 week ago

bit

LOW Academic International

How Far Can We Go with Pixels Alone? A Pilot Study on Screen-Only Navigation in Commercial 3D ARPGs

arXiv:2602.18981v1 Announce Type: new Abstract: Modern 3D game levels rely heavily on visual guidance, yet the navigability of level layouts remains difficult to quantify. Prior work either simulates play in simplified environments or analyzes static screenshots for visual affordances, but...

1 min 1 month, 1 week ago

bit

LOW Academic International

Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts

arXiv:2602.19244v1 Announce Type: new Abstract: On-the-fly Directed Controller Synthesis (OTF-DCS) mitigates state-space explosion by incrementally exploring the system and relies critically on an exploration policy to guide search efficiently. Recent reinforcement learning (RL) approaches learn such policies and achieve promising...

1 min 1 month, 1 week ago

bit

LOW Academic International

Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces

arXiv:2602.19367v1 Announce Type: new Abstract: The Platonic Representation Hypothesis posits that learned representations from models trained on different modalities converge to a shared latent structure of the world. However, this hypothesis has largely been examined in vision and language, and...

1 min 1 month, 1 week ago

bit

LOW Academic United States

Asymptotic Semantic Collapse in Hierarchical Optimization

arXiv:2602.18450v1 Announce Type: new Abstract: Multi-agent language systems can exhibit a failure mode where a shared dominant context progressively absorbs individual semantics, yielding near-uniform behavior across agents. We study this effect under the name Asymptotic Semantic Collapse in Hierarchical Optimization....

1 min 1 month, 1 week ago

bit

LOW Academic International

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

arXiv:2602.18487v1 Announce Type: new Abstract: This paper introduces GLiNER-bi-Encoder, a novel architecture for Named Entity Recognition (NER) that harmonizes zero-shot flexibility with industrial-scale efficiency. While the original GLiNER framework offers strong generalization, its joint-encoding approach suffers from quadratic complexity as...

1 min 1 month, 1 week ago

adr

LOW Academic International

Causal Identification from Counterfactual Data: Completeness and Bounding Results

arXiv:2602.23541v1 Announce Type: new Abstract: Previous work establishing completeness results for $\textit{counterfactual identification}$ has been circumscribed to the setting where the input data belongs to observational or interventional distributions (Layers 1 and 2 of Pearl's Causal Hierarchy), since it was...

1 min 1 month, 1 week ago

bit

LOW Academic International

SleepLM: Natural-Language Intelligence for Human Sleep

arXiv:2602.23605v1 Announce Type: new Abstract: We present SleepLM, a family of sleep-language foundation models that enable human sleep alignment, interpretation, and interaction with natural language. Despite the critical role of sleep, learning-based sleep analysis systems operate in closed label spaces...

1 min 1 month, 1 week ago

bit

LOW Academic United States

The Auton Agentic AI Framework

arXiv:2602.23720v1 Announce Type: new Abstract: The field of Artificial Intelligence is undergoing a transition from Generative AI -- probabilistic generation of text and images -- to Agentic AI, in which autonomous systems execute actions within external environments on behalf of...

1 min 1 month, 1 week ago

enforcement

LOW Academic European Union

RUMAD: Reinforcement-Unifying Multi-Agent Debate

arXiv:2602.23864v1 Announce Type: new Abstract: Multi-agent debate (MAD) systems leverage collective intelligence to enhance reasoning capabilities, yet existing approaches struggle to simultaneously optimize accuracy, consensus formation, and computational efficiency. Static topology methods lack adaptability to task complexity variations, while external...

1 min 1 month, 1 week ago

bit

LOW Academic International

Pessimistic Auxiliary Policy for Offline Reinforcement Learning

arXiv:2602.23974v1 Announce Type: new Abstract: Offline reinforcement learning aims to learn an agent from pre-collected datasets, avoiding unsafe and inefficient real-time interaction. However, inevitable access to out-ofdistribution actions during the learning process introduces approximation errors, causing the error accumulation and...

1 min 1 month, 1 week ago

bit

LOW Academic International

Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance

arXiv:2602.24110v1 Announce Type: new Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) has emerged as a powerful paradigm for enhancing the complex reasoning capabilities of Large Reasoning Models. However, standard outcome-based supervision suffers from a critical limitation that penalizes trajectories that...

1 min 1 month, 1 week ago

bit

LOW Academic United States

Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook

arXiv:2602.20044v1 Announce Type: cross Abstract: Within twelve days of launch, an AI-native social platform exhibits extreme attention concentration, hierarchical role separation, and one-way attention flow, consistent with the hypothesis that stratification in agent ecosystems can emerge rapidly rather than gradually....

1 min 1 month, 1 week ago

bit

LOW Academic International

Long Range Frequency Tuning for QML

arXiv:2602.23409v1 Announce Type: cross Abstract: Quantum machine learning models using angle encoding naturally represent truncated Fourier series, providing universal function approximation capabilities with sufficient circuit depth. For unary fixed-frequency encodings, circuit depth scales as O(omega_max * (omega_max + epsilon^{-2})) with...

1 min 1 month, 1 week ago

bit

LOW Academic International

Human Supervision as an Information Bottleneck: A Unified Theory of Error Floors in Human-Guided Learning

arXiv:2602.23446v1 Announce Type: cross Abstract: Large language models are trained primarily on human-generated data and feedback, yet they exhibit persistent errors arising from annotation noise, subjective preferences, and the limited expressive bandwidth of natural language. We argue that these limitations...

1 min 1 month, 1 week ago

bit

LOW Academic United States

Optimization of Edge Directions and Weights for Mixed Guidance Graphs in Lifelong Multi-Agent Path Finding

arXiv:2602.23468v1 Announce Type: cross Abstract: Multi-Agent Path Finding (MAPF) aims to move agents from their start to goal vertices on a graph. Lifelong MAPF (LMAPF) continuously assigns new goals to agents as they complete current ones. To guide agents' movement...

1 min 1 month, 1 week ago

bit

LOW Academic United States

EmCoop: A Framework and Benchmark for Embodied Cooperation Among LLM Agents

arXiv:2603.00349v1 Announce Type: new Abstract: Real-world scenarios increasingly require multiple embodied agents to collaborate in dynamic environments under embodied constraints, as many tasks exceed the capabilities of any single agent. Recent advances in large language models (LLMs) enable high-level cognitive...

1 min 1 month, 1 week ago

bit

LOW Academic United States

From Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems

arXiv:2603.00472v1 Announce Type: new Abstract: Agentic AI systems exhibit numerous crosscutting concerns -- security, observability, cost management, fault tolerance -- that are poorly modularized in current implementations, contributing to the high failure rate of AI projects in reaching production. The...

1 min 1 month, 1 week ago

bit

LOW Academic International

AI Runtime Infrastructure

arXiv:2603.00495v1 Announce Type: new Abstract: We introduce AI Runtime Infrastructure, a distinct execution-time layer that operates above the model and below the application, actively observing, reasoning over, and intervening in agent behavior to optimize task success, latency, token efficiency, reliability,...

1 min 1 month, 1 week ago

enforcement

LOW Academic International

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs

arXiv:2603.00590v1 Announce Type: new Abstract: As artificial intelligence (AI) is increasingly deployed across domains, ensuring fairness has become a core challenge. However, the field faces a "Tower of Babel'' dilemma: fairness metrics abound, yet their underlying philosophical assumptions often conflict,...

1 min 1 month, 1 week ago

bit

Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens

Context Shapes LLMs Retrieval-Augmented Fact-Checking Effectiveness

Mind the (DH) Gap! A Contrast in Risky Choices Between Reasoning and Conversational LLMs

Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge

IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

A Case Study of Selected PTQ Baselines for Reasoning LLMs on Ascend NPU

Can LLM Safety Be Ensured by Constraining Parameter Regions?

ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs

Inelastic Constitutive Kolmogorov-Arnold Networks: A generalized framework for automated discovery of interpretable inelastic material models

TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models

Modularity is the Bedrock of Natural and Artificial Intelligence

How Far Can We Go with Pixels Alone? A Pilot Study on Screen-Only Navigation in Commercial 3D ARPGs

Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts

Time Series, Vision, and Language: Exploring the Limits of Alignment in Contrastive Representation Spaces

Asymptotic Semantic Collapse in Hierarchical Optimization

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

Causal Identification from Counterfactual Data: Completeness and Bounding Results

SleepLM: Natural-Language Intelligence for Human Sleep

The Auton Agentic AI Framework

RUMAD: Reinforcement-Unifying Multi-Agent Debate

Pessimistic Auxiliary Policy for Offline Reinforcement Learning

Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance

Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook

Long Range Frequency Tuning for QML

Human Supervision as an Information Bottleneck: A Unified Theory of Error Floors in Human-Guided Learning

Optimization of Edge Directions and Weights for Mixed Guidance Graphs in Lifelong Multi-Agent Path Finding

EmCoop: A Framework and Benchmark for Embodied Cooperation Among LLM Agents

From Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems

AI Runtime Infrastructure

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.