Litigation

LOW Academic International

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

arXiv:2602.18734v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a ranking-centric, asymmetric dependency paradigm, where the...

1 min 1 month, 1 week ago

evidence

LOW Academic International

BURMESE-SAN: Burmese NLP Benchmark for Evaluating Large Language Models

arXiv:2602.18788v1 Announce Type: new Abstract: We introduce BURMESE-SAN, the first holistic benchmark that systematically evaluates large language models (LLMs) for Burmese across three core NLP competencies: understanding (NLU), reasoning (NLR), and generation (NLG). BURMESE-SAN consolidates seven subtasks spanning these competencies,...

1 min 1 month, 1 week ago

standing

LOW Academic International

SleepLM: Natural-Language Intelligence for Human Sleep

arXiv:2602.23605v1 Announce Type: new Abstract: We present SleepLM, a family of sleep-language foundation models that enable human sleep alignment, interpretation, and interaction with natural language. Despite the critical role of sleep, learning-based sleep analysis systems operate in closed label spaces...

1 min 1 month, 1 week ago

standing

LOW Academic International

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

arXiv:2602.23876v1 Announce Type: new Abstract: Designing efficient reward functions for low-level control tasks is a challenging problem. Recent research aims to reduce reliance on expert experience by using Large Language Models (LLMs) with task information to generate dense reward functions....

1 min 1 month, 1 week ago

standing

LOW Academic International

Artificial Agency Program: Curiosity, compression, and communication in agents

arXiv:2602.24100v1 Announce Type: new Abstract: This paper presents the Artificial Agency Program (AAP), a position and research agenda for building AI systems as reality embedded, resource-bounded agents whose development is driven by curiosity-as-learning-progress under physical and computational constraints. The central...

1 min 1 month, 1 week ago

standing

LOW Academic International

Keyword search is all you need: Achieving RAG-Level Performance without vector databases using agentic tool use

arXiv:2602.23368v1 Announce Type: cross Abstract: While Retrieval-Augmented Generation (RAG) has proven effective for generating accurate, context-based responses based on existing knowledge bases, it presents several challenges including retrieval quality dependencies, integration complexity and cost. Recent advances in agentic-RAG and tool-augmented...

1 min 1 month, 1 week ago

standing

LOW Academic International

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

arXiv:2602.23370v1 Announce Type: cross Abstract: Long-document topic segmentation plays an important role in information retrieval and document understanding, yet existing methods still show clear shortcomings in ultra-long text settings. Traditional discriminative models are constrained by fixed windows and cannot model...

1 min 1 month, 1 week ago

standing

LOW Academic United States

Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India

arXiv:2602.23371v1 Announce Type: cross Abstract: Legal research in India involves navigating long and heterogeneous documents spanning statutes, constitutional provisions, penal codes, and judicial precedents, where purely keyword-based or embedding-only retrieval systems often fail to support structured legal reasoning. Recent retrieval...

1 min 1 month, 1 week ago

evidence

LOW Academic International

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

arXiv:2602.23452v1 Announce Type: new Abstract: Scientific research relies on accurate citation for attribution and integrity, yet large language models (LLMs) introduce a new risk: fabricated references that appear plausible but correspond to no real publications. Such hallucinated citations have already...

1 min 1 month, 1 week ago

evidence

LOW Academic International

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

arXiv:2603.00267v1 Announce Type: new Abstract: Misinformation spreading over the Internet poses a significant threat to both societies and individuals, necessitating robust and scalable fact-checking that relies on retrieving accurate and trustworthy evidence. Previous methods rely on semantic and social-contextual patterns...

1 min 1 month, 1 week ago

evidence

LOW Academic European Union

DIG to Heal: Scaling General-purpose Agent Collaboration via Explainable Dynamic Decision Paths

arXiv:2603.00309v1 Announce Type: new Abstract: The increasingly popular agentic AI paradigm promises to harness the power of multiple, general-purpose large language model (LLM) agents to collaboratively complete complex tasks. While many agentic AI systems utilize predefined workflows or agent roles...

1 min 1 month, 1 week ago

standing

LOW Academic International

Conservative Equilibrium Discovery in Offline Game-Theoretic Multiagent Reinforcement Learning

arXiv:2603.00374v1 Announce Type: new Abstract: Offline learning of strategies takes data efficiency to its extreme by restricting algorithms to a fixed dataset of state-action trajectories. We consider the problem in a mixed-motive multiagent setting, where the goal is to solve...

1 min 1 month, 1 week ago

discovery

LOW Academic United States

Confusion-Aware Rubric Optimization for LLM-based Automated Grading

arXiv:2603.00451v1 Announce Type: new Abstract: Accurate and unambiguous guidelines are critical for large language model (LLM) based graders, yet manually crafting these prompts is often sub-optimal as LLMs can misinterpret expert guidelines or lack necessary domain specificity. Consequently, the field...

1 min 1 month, 1 week ago

trial

LOW Academic International

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

arXiv:2603.00460v1 Announce Type: new Abstract: Clinical decision-making requires synthesizing heterogeneous evidence, including patient histories, clinical guidelines, and trajectories of comparable cases. While large language models (LLMs) offer strong reasoning capabilities, they remain prone to hallucinations and struggle to integrate long,...

1 min 1 month, 1 week ago

evidence

LOW Academic United States

From Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems

arXiv:2603.00472v1 Announce Type: new Abstract: Agentic AI systems exhibit numerous crosscutting concerns -- security, observability, cost management, fault tolerance -- that are poorly modularized in current implementations, contributing to the high failure rate of AI projects in reaching production. The...

1 min 1 month, 1 week ago

discovery

LOW Academic International

LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks

arXiv:2603.00490v1 Announce Type: new Abstract: The rapid progress of Multimodal Large Language Models (MLLMs) marks a significant step toward artificial general intelligence, offering great potential for augmenting human capabilities. However, their ability to provide effective assistance in dynamic, real-world environments...

1 min 1 month, 1 week ago

standing

LOW Academic International

MicroVerse: A Preliminary Exploration Toward a Micro-World Simulation

arXiv:2603.00585v1 Announce Type: new Abstract: Recent advances in video generation have opened new avenues for macroscopic simulation of complex dynamic systems, but their application to microscopic phenomena remains largely unexplored. Microscale simulation holds great promise for biomedical applications such as...

1 min 1 month, 1 week ago

discovery

LOW Academic International

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs

arXiv:2603.00590v1 Announce Type: new Abstract: As artificial intelligence (AI) is increasingly deployed across domains, ensuring fairness has become a core challenge. However, the field faces a "Tower of Babel'' dilemma: fairness metrics abound, yet their underlying philosophical assumptions often conflict,...

1 min 1 month, 1 week ago

standing

LOW Academic International

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

arXiv:2603.00873v1 Announce Type: new Abstract: With the increasing demand for step-wise, cross-modal, and knowledge-grounded reasoning, multimodal large language models (MLLMs) are evolving beyond the traditional fixed retrieve-then-generate paradigm toward more sophisticated agentic multimodal retrieval-augmented generation (MM-RAG). Existing benchmarks, however, mainly...

1 min 1 month, 1 week ago

evidence

LOW Academic European Union

BioProAgent: Neuro-Symbolic Grounding for Constrained Scientific Planning

arXiv:2603.00876v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated significant reasoning capabilities in scientific discovery but struggle to bridge the gap to physical execution in wet-labs. In these irreversible environments, probabilistic hallucinations are not merely incorrect, but also...

1 min 1 month, 1 week ago

discovery

LOW Academic International

HVR-Met: A Hypothesis-Verification-Replaning Agentic System for Extreme Weather Diagnosis

arXiv:2603.01121v1 Announce Type: new Abstract: While deep learning-based weather forecasting paradigms have made significant strides, addressing extreme weather diagnostics remains a formidable challenge. This gap exists primarily because the diagnostic process demands sophisticated multi-step logical reasoning, dynamic tool invocation, and...

1 min 1 month, 1 week ago

evidence

LOW Academic European Union

FCN-LLM: Empower LLM for Brain Functional Connectivity Network Understanding via Graph-level Multi-task Instruction Tuning

arXiv:2603.01135v1 Announce Type: new Abstract: Large Language Models have achieved remarkable success in language understanding and reasoning, and their multimodal extensions enable comprehension of images, video, and audio. Inspired by this, foundation models for brain functional connectivity networks derived from...

1 min 1 month, 1 week ago

standing

LOW Academic United States

TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation

arXiv:2603.00025v1 Announce Type: new Abstract: Direct Preference Optimization is an offline post-SFT method for aligning language models from preference pairs, with strong results in instruction following and summarization. However, DPO's sequence-level implicit reward can be brittle for token-critical structured prediction...

1 min 1 month, 1 week ago

evidence

LOW Academic International

ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

arXiv:2603.00026v1 Announce Type: new Abstract: Effective memory management is essential for large language model (LLM) agents handling long-term interactions. Current memory frameworks typically treat agents as passive "recorders" and retrieve information without understanding its deeper implications. They may fail in...

1 min 1 month, 1 week ago

standing

LOW Academic United States

Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving

arXiv:2603.02214v1 Announce Type: new Abstract: Federated Inference (FI) studies how independently trained and privately owned models can collaborate at inference time without sharing data or model parameters. While recent work has explored secure and distributed inference from disparate perspectives, a...

1 min 1 month, 1 week ago

standing

LOW Academic International

Engineering Reasoning and Instruction (ERI) Benchmark: A Large Taxonomy-driven Dataset for Foundation Models and Agents

arXiv:2603.02239v1 Announce Type: new Abstract: The Engineering Reasoning and Instruction (ERI) benchmark is a taxonomy-driven instruction dataset designed to train and evaluate engineering-capable large language models (LLMs) and agents. This dataset spans nine engineering fields (namely: civil, mechanical, electrical, chemical,...

1 min 1 month, 1 week ago

trial

LOW Academic International

Estimating Visual Attribute Effects in Advertising from Observational Data: A Deepfake-Informed Double Machine Learning Approach

arXiv:2603.02359v1 Announce Type: new Abstract: Digital advertising increasingly relies on visual content, yet marketers lack rigorous methods for understanding how specific visual attributes causally affect consumer engagement. This paper addresses a fundamental methodological challenge: estimating causal effects when the treatment,...

1 min 1 month, 1 week ago

standing

LOW Academic International

AgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows

arXiv:2603.02601v1 Announce Type: new Abstract: Autonomous AI agents are deployed at unprecedented scale, yet no principled methodology exists for verifying that an agent has not regressed after changes to its prompts, tools, models, or orchestration logic. We present AgentAssay, the...

1 min 1 month, 1 week ago

trial

LOW Academic International

SorryDB: Can AI Provers Complete Real-World Lean Theorems?

arXiv:2603.02668v1 Announce Type: new Abstract: We present SorryDB, a dynamically-updating benchmark of open Lean tasks drawn from 78 real world formalization projects on GitHub. Unlike existing static benchmarks, often composed of competition problems, hillclimbing the SorryDB benchmark will yield tools...

1 min 1 month, 1 week ago

standing

LOW Academic United States

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

arXiv:2603.02766v1 Announce Type: new Abstract: Coding agents are increasingly used as general-purpose problem solvers, but their flexibility does not by itself confer the domain expertise needed for specialized tasks. Recent work addresses this through \textit{agent skills}: reusable workflows, and code,...

1 min 1 month, 1 week ago

discovery

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

BURMESE-SAN: Burmese NLP Benchmark for Evaluating Large Language Models

SleepLM: Natural-Language Intelligence for Human Sleep

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

Artificial Agency Program: Curiosity, compression, and communication in agents

Keyword search is all you need: Achieving RAG-Level Performance without vector databases using agentic tool use

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

DIG to Heal: Scaling General-purpose Agent Collaboration via Explainable Dynamic Decision Paths

Conservative Equilibrium Discovery in Offline Game-Theoretic Multiagent Reinforcement Learning

Confusion-Aware Rubric Optimization for LLM-based Automated Grading

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

From Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems

LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks

MicroVerse: A Preliminary Exploration Toward a Micro-World Simulation

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

BioProAgent: Neuro-Symbolic Grounding for Constrained Scientific Planning

HVR-Met: A Hypothesis-Verification-Replaning Agentic System for Extreme Weather Diagnosis

FCN-LLM: Empower LLM for Brain Functional Connectivity Network Understanding via Graph-level Multi-task Instruction Tuning

TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation

ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving

Engineering Reasoning and Instruction (ERI) Benchmark: A Large Taxonomy-driven Dataset for Foundation Models and Agents

Estimating Visual Attribute Effects in Advertising from Observational Data: A Deepfake-Informed Double Machine Learning Approach

AgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows

SorryDB: Can AI Provers Complete Real-World Lean Theorems?

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.