International Law

LOW Academic International

Causal Identification from Counterfactual Data: Completeness and Bounding Results

arXiv:2602.23541v1 Announce Type: new Abstract: Previous work establishing completeness results for $\textit{counterfactual identification}$ has been circumscribed to the setting where the input data belongs to observational or interventional distributions (Layers 1 and 2 of Pearl's Causal Hierarchy), since it was...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Planning under Distribution Shifts with Causal POMDPs

arXiv:2602.23545v1 Announce Type: new Abstract: In the real world, planning is often challenged by distribution shifts. As such, a model of the environment obtained under one set of conditions may no longer remain valid as the distribution of states or...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

arXiv:2602.23579v1 Announce Type: new Abstract: The Multiple Traveling Salesman Problem (mTSP) extends the Traveling Salesman Problem to m tours that start and end at a common depot and jointly visit all customers exactly once. In the min-max variant, the objective...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

SleepLM: Natural-Language Intelligence for Human Sleep

arXiv:2602.23605v1 Announce Type: new Abstract: We present SleepLM, a family of sleep-language foundation models that enable human sleep alignment, interpretation, and interaction with natural language. Despite the critical role of sleep, learning-based sleep analysis systems operate in closed label spaces...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

arXiv:2602.23643v1 Announce Type: new Abstract: Everyone from AI executives and researchers to doomsayers, politicians, and activists is talking about Artificial General Intelligence (AGI). Yet, they often don't seem to agree on its exact definition. One common definition of AGI is...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

From Flat Logs to Causal Graphs: Hierarchical Failure Attribution for LLM-based Multi-Agent Systems

arXiv:2602.23701v1 Announce Type: new Abstract: LLM-powered Multi-Agent Systems (MAS) have demonstrated remarkable capabilities in complex domains but suffer from inherent fragility and opaque failure mechanisms. Existing failure attribution methods, whether relying on direct prompting, costly replays, or supervised fine-tuning, typically...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation

arXiv:2602.23716v1 Announce Type: new Abstract: Large Language Model (LLM)-based agents show promise for e-commerce conversational shopping, yet existing implementations lack the interaction depth and contextual breadth required for complex product research. Meanwhile, the Deep Research paradigm, despite advancing information synthesis...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

The Auton Agentic AI Framework

arXiv:2602.23720v1 Announce Type: new Abstract: The field of Artificial Intelligence is undergoing a transition from Generative AI -- probabilistic generation of text and images -- to Agentic AI, in which autonomous systems execute actions within external environments on behalf of...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Reasoning-Driven Multimodal LLM for Domain Generalization

arXiv:2602.23777v1 Announce Type: new Abstract: This paper addresses the domain generalization (DG) problem in deep learning. While most DG methods focus on enforcing visual feature invariance, we leverage the reasoning capability of multimodal large language models (MLLMs) and explore the...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models

arXiv:2602.23802v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have shown remarkable progress in visual reasoning and understanding tasks but still struggle to capture the complexity and subjectivity of human emotions. Existing approaches based on supervised fine-tuning often suffer...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

RUMAD: Reinforcement-Unifying Multi-Agent Debate

arXiv:2602.23864v1 Announce Type: new Abstract: Multi-agent debate (MAD) systems leverage collective intelligence to enhance reasoning capabilities, yet existing approaches struggle to simultaneously optimize accuracy, consensus formation, and computational efficiency. Static topology methods lack adaptability to task complexity variations, while external...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

arXiv:2602.23876v1 Announce Type: new Abstract: Designing efficient reward functions for low-level control tasks is a challenging problem. Recent research aims to reduce reliance on expert experience by using Large Language Models (LLMs) with task information to generate dense reward functions....

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Pessimistic Auxiliary Policy for Offline Reinforcement Learning

arXiv:2602.23974v1 Announce Type: new Abstract: Offline reinforcement learning aims to learn an agent from pre-collected datasets, avoiding unsafe and inefficient real-time interaction. However, inevitable access to out-ofdistribution actions during the learning process introduces approximation errors, causing the error accumulation and...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Portfolio Reinforcement Learning with Scenario-Context Rollout

arXiv:2602.24037v1 Announce Type: new Abstract: Market regime shifts induce distribution shifts that can degrade the performance of portfolio rebalancing policies. We propose macro-conditioned scenario-context rollout (SCR) that generates plausible next-day multivariate return scenarios under stress events. However, doing so faces...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Human or Machine? A Preliminary Turing Test for Speech-to-Speech Interaction

arXiv:2602.24080v1 Announce Type: new Abstract: The pursuit of human-like conversational agents has long been guided by the Turing test. For modern speech-to-speech (S2S) systems, a critical yet unanswered question is whether they can converse like humans. To tackle this, we...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance

arXiv:2602.24097v1 Announce Type: new Abstract: Winter road maintenance is critical for ensuring public safety and reducing environmental impacts, yet existing methods struggle to manage large-scale routing problems effectively and mostly reply on human decision. This study presents a novel, scalable...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Artificial Agency Program: Curiosity, compression, and communication in agents

arXiv:2602.24100v1 Announce Type: new Abstract: This paper presents the Artificial Agency Program (AAP), a position and research agenda for building AI systems as reality embedded, resource-bounded agents whose development is driven by curiosity-as-learning-progress under physical and computational constraints. The central...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance

arXiv:2602.24110v1 Announce Type: new Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) has emerged as a powerful paradigm for enhancing the complex reasoning capabilities of Large Reasoning Models. However, standard outcome-based supervision suffers from a critical limitation that penalizes trajectories that...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics

arXiv:2602.24173v1 Announce Type: new Abstract: We present a new approach for benchmarking Large Language Model (LLM) capabilities on research-level mathematics. Existing benchmarks largely rely on static, hand-curated sets of contest or textbook-style problems as proxies for mathematical research. Instead, we...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Learning Flexible Job Shop Scheduling under Limited Buffers and Material Kitting Constraints

arXiv:2602.24180v1 Announce Type: new Abstract: The Flexible Job Shop Scheduling Problem (FJSP) originates from real production lines, while some practical constraints are often ignored or idealized in current FJSP studies, among which the limited buffer problem has a particular impact...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

A Minimal Agent for Automated Theorem Proving

arXiv:2602.24273v1 Announce Type: new Abstract: We propose a minimal agentic baseline that enables systematic comparison across different AI-based theorem prover architectures. This design implements the core features shared among state-of-the-art systems: iterative proof refinement, library search and context management. We...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

arXiv:2602.24288v1 Announce Type: new Abstract: The fast-growing demands in using Large Language Models (LLMs) to tackle complex multi-step data science tasks create an emergent need for accurate benchmarking. There are two major gaps in existing benchmarks: (i) the lack of...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

QD-MAPPER: A Quality Diversity Framework to Automatically Evaluate Multi-Agent Path Finding Algorithms in Diverse Maps

arXiv:2409.06888v5 Announce Type: cross Abstract: We use the Quality Diversity (QD) algorithm with Neural Cellular Automata (NCA) to automatically evaluate Multi-Agent Path Finding (MAPF) algorithms by generating diverse maps. Previously, researchers typically evaluate MAPF algorithms on a set of specific,...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook

arXiv:2602.20044v1 Announce Type: cross Abstract: Within twelve days of launch, an AI-native social platform exhibits extreme attention concentration, hierarchical role separation, and one-way attention flow, consistent with the hypothesis that stratification in agent ecosystems can emerge rapidly rather than gradually....

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Keyword search is all you need: Achieving RAG-Level Performance without vector databases using agentic tool use

arXiv:2602.23368v1 Announce Type: cross Abstract: While Retrieval-Augmented Generation (RAG) has proven effective for generating accurate, context-based responses based on existing knowledge bases, it presents several challenges including retrieval quality dependencies, integration complexity and cost. Recent advances in agentic-RAG and tool-augmented...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

arXiv:2602.23370v1 Announce Type: cross Abstract: Long-document topic segmentation plays an important role in information retrieval and document understanding, yet existing methods still show clear shortcomings in ultra-long text settings. Traditional discriminative models are constrained by fixed windows and cannot model...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India

arXiv:2602.23371v1 Announce Type: cross Abstract: Legal research in India involves navigating long and heterogeneous documents spanning statutes, constitutional provisions, penal codes, and judicial precedents, where purely keyword-based or embedding-only retrieval systems often fail to support structured legal reasoning. Recent retrieval...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

arXiv:2602.23372v1 Announce Type: cross Abstract: GraphRAG systems improve multi-hop retrieval by modeling structure, but many approaches rely on expensive LLM-based graph construction and GPU-heavy inference. We present SPRIG (Seeded Propagation for Retrieval In Graphs), a CPU-only, linear-time, token-free GraphRAG pipeline...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Now You See Me: Designing Responsible AI Dashboards for Early-Stage Health Innovation

arXiv:2602.23378v1 Announce Type: cross Abstract: Innovative HealthTech teams develop Artificial Intelligence (AI) systems in contexts where ethical expectations and organizational priorities must be balanced under severe resource constraints. While Responsible AI practices are expected to guide the design and evaluation...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

arXiv:2602.23388v1 Announce Type: cross Abstract: The rising demand for inclusive speech technologies amplifies the need for multilingual datasets for Natural Language Processing (NLP) research. However, limited awareness of existing task-specific resources in low-resource languages hinders research. This challenge is especially...

1 min 1 month, 2 weeks ago

ear

Causal Identification from Counterfactual Data: Completeness and Bounding Results

Planning under Distribution Shifts with Causal POMDPs

Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem

SleepLM: Natural-Language Intelligence for Human Sleep

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

From Flat Logs to Causal Graphs: Hierarchical Failure Attribution for LLM-based Multi-Agent Systems

ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation

The Auton Agentic AI Framework

Reasoning-Driven Multimodal LLM for Domain Generalization

EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models

RUMAD: Reinforcement-Unifying Multi-Agent Debate

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

Pessimistic Auxiliary Policy for Offline Reinforcement Learning

Portfolio Reinforcement Learning with Scenario-Context Rollout

Human or Machine? A Preliminary Turing Test for Speech-to-Speech Interaction

Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance

Artificial Agency Program: Curiosity, compression, and communication in agents

Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance

LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics

Learning Flexible Job Shop Scheduling under Limited Buffers and Material Kitting Constraints

A Minimal Agent for Automated Theorem Proving

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

QD-MAPPER: A Quality Diversity Framework to Automatically Evaluate Multi-Agent Path Finding Algorithms in Diverse Maps

Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook

Keyword search is all you need: Achieving RAG-Level Performance without vector databases using agentic tool use

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India

Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

Now You See Me: Designing Responsible AI Dashboards for Early-Stage Health Innovation

Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.