Litigation

LOW Academic European Union

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

arXiv:2602.18985v1 Announce Type: new Abstract: Infrared radiation computing underpins advances in climate science, remote sensing and spectroscopy but remains constrained by manual workflows. We introduce InfEngine, an autonomous intelligent computational engine designed to drive a paradigm shift from human-led orchestration...

1 min 1 month, 1 week ago

discovery

LOW Academic International

DoAtlas-1: A Causal Compilation Paradigm for Clinical AI

arXiv:2602.19158v1 Announce Type: new Abstract: Medical foundation models generate narrative explanations but cannot quantify intervention effects, detect evidence conflicts, or validate literature claims, limiting clinical auditability. We propose causal compilation, a paradigm that transforms medical evidence from narrative text into...

1 min 1 month, 1 week ago

evidence

LOW Academic United States

Beyond Behavioural Trade-Offs: Mechanistic Tracing of Pain-Pleasure Decisions in an LLM

arXiv:2602.19159v1 Announce Type: new Abstract: Prior behavioural work suggests that some LLMs alter choices when options are framed as causing pain or pleasure, and that such deviations can scale with stated intensity. To bridge behavioural evidence (what the model does)...

1 min 1 month, 1 week ago

evidence

LOW Academic International

Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

arXiv:2602.19225v1 Announce Type: new Abstract: Multi-turn LLM agents are becoming pivotal to production systems, spanning customer service automation, e-commerce assistance, and interactive task management, where accurately distinguishing high-value informative signals from stochastic noise is critical for sample-efficient training. In real-world...

1 min 1 month, 1 week ago

trial

LOW Academic European Union

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

arXiv:2602.19298v1 Announce Type: new Abstract: Evaluating personalized, sequential treatment strategies for Alzheimer's disease (AD) using clinical trials is often impractical due to long disease horizons and substantial inter-patient heterogeneity. To address these constraints, we present the Alzheimer's Learning Platform for...

1 min 1 month, 1 week ago

trial

LOW Academic United States

Artificial Intelligence for Modeling & Simulation in Digital Twins

arXiv:2602.19390v1 Announce Type: new Abstract: The convergence of modeling & simulation (M&S) and artificial intelligence (AI) is leaving its marks on advanced digital technology. Pertinent examples are digital twins (DTs) - high-fidelity, live representations of physical assets, and frequent enablers...

1 min 1 month, 1 week ago

standing

LOW Academic International

ReportLogic: Evaluating Logical Quality in Deep Research Reports

arXiv:2602.18446v1 Announce Type: new Abstract: Users increasingly rely on Large Language Models (LLMs) for Deep Research, using them to synthesize diverse sources into structured reports that support understanding and action. In this context, the practical reliability of such reports hinges...

1 min 1 month, 1 week ago

standing

LOW Academic International

ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

arXiv:2602.18447v1 Announce Type: new Abstract: Chain-of-Thought reasoning significantly improves the performance of large language models on complex tasks, but incurs high inference latency due to long generation traces. Step-level speculative reasoning aims to mitigate this cost, yet existing approaches face...

1 min 1 month, 1 week ago

standing

LOW Academic United States

INSURE-Dial: A Phase-Aware Conversational Dataset \& Benchmark for Compliance Verification and Phase Detection

arXiv:2602.18448v1 Announce Type: new Abstract: Administrative phone tasks drain roughly 1 trillion USD annually from U.S. healthcare, with over 500 million insurance-benefit verification calls manually handled in 2024. We introduce INSURE-Dial, to our knowledge the first public benchmark for developing...

1 min 1 month, 1 week ago

evidence

LOW Academic International

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

arXiv:2602.18487v1 Announce Type: new Abstract: This paper introduces GLiNER-bi-Encoder, a novel architecture for Named Entity Recognition (NER) that harmonizes zero-shot flexibility with industrial-scale efficiency. While the original GLiNER framework offers strong generalization, its joint-encoding approach suffers from quadratic complexity as...

1 min 1 month, 1 week ago

trial

LOW Academic International

Contradiction to Consensus: Dual Perspective, Multi Source Retrieval Based Claim Verification with Source Level Disagreement using LLM

arXiv:2602.18693v1 Announce Type: new Abstract: The spread of misinformation across digital platforms can pose significant societal risks. Claim verification, a.k.a. fact-checking, systems can help identify potential misinformation. However, their efficacy is limited by the knowledge sources that they rely on....

1 min 1 month, 1 week ago

evidence

LOW Academic International

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

arXiv:2602.18734v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a ranking-centric, asymmetric dependency paradigm, where the...

1 min 1 month, 1 week ago

evidence

LOW Academic International

BURMESE-SAN: Burmese NLP Benchmark for Evaluating Large Language Models

arXiv:2602.18788v1 Announce Type: new Abstract: We introduce BURMESE-SAN, the first holistic benchmark that systematically evaluates large language models (LLMs) for Burmese across three core NLP competencies: understanding (NLU), reasoning (NLR), and generation (NLG). BURMESE-SAN consolidates seven subtasks spanning these competencies,...

1 min 1 month, 1 week ago

standing

LOW Academic International

SleepLM: Natural-Language Intelligence for Human Sleep

arXiv:2602.23605v1 Announce Type: new Abstract: We present SleepLM, a family of sleep-language foundation models that enable human sleep alignment, interpretation, and interaction with natural language. Despite the critical role of sleep, learning-based sleep analysis systems operate in closed label spaces...

1 min 1 month, 1 week ago

standing

LOW Academic International

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

arXiv:2602.23876v1 Announce Type: new Abstract: Designing efficient reward functions for low-level control tasks is a challenging problem. Recent research aims to reduce reliance on expert experience by using Large Language Models (LLMs) with task information to generate dense reward functions....

1 min 1 month, 1 week ago

standing

LOW Academic International

Artificial Agency Program: Curiosity, compression, and communication in agents

arXiv:2602.24100v1 Announce Type: new Abstract: This paper presents the Artificial Agency Program (AAP), a position and research agenda for building AI systems as reality embedded, resource-bounded agents whose development is driven by curiosity-as-learning-progress under physical and computational constraints. The central...

1 min 1 month, 1 week ago

standing

LOW Academic International

Keyword search is all you need: Achieving RAG-Level Performance without vector databases using agentic tool use

arXiv:2602.23368v1 Announce Type: cross Abstract: While Retrieval-Augmented Generation (RAG) has proven effective for generating accurate, context-based responses based on existing knowledge bases, it presents several challenges including retrieval quality dependencies, integration complexity and cost. Recent advances in agentic-RAG and tool-augmented...

1 min 1 month, 1 week ago

standing

LOW Academic International

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

arXiv:2602.23370v1 Announce Type: cross Abstract: Long-document topic segmentation plays an important role in information retrieval and document understanding, yet existing methods still show clear shortcomings in ultra-long text settings. Traditional discriminative models are constrained by fixed windows and cannot model...

1 min 1 month, 1 week ago

standing

LOW Academic United States

Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India

arXiv:2602.23371v1 Announce Type: cross Abstract: Legal research in India involves navigating long and heterogeneous documents spanning statutes, constitutional provisions, penal codes, and judicial precedents, where purely keyword-based or embedding-only retrieval systems often fail to support structured legal reasoning. Recent retrieval...

1 min 1 month, 1 week ago

evidence

LOW Academic International

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

arXiv:2602.23452v1 Announce Type: new Abstract: Scientific research relies on accurate citation for attribution and integrity, yet large language models (LLMs) introduce a new risk: fabricated references that appear plausible but correspond to no real publications. Such hallucinated citations have already...

1 min 1 month, 1 week ago

evidence

LOW Academic International

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

arXiv:2603.00267v1 Announce Type: new Abstract: Misinformation spreading over the Internet poses a significant threat to both societies and individuals, necessitating robust and scalable fact-checking that relies on retrieving accurate and trustworthy evidence. Previous methods rely on semantic and social-contextual patterns...

1 min 1 month, 1 week ago

evidence

LOW Academic European Union

DIG to Heal: Scaling General-purpose Agent Collaboration via Explainable Dynamic Decision Paths

arXiv:2603.00309v1 Announce Type: new Abstract: The increasingly popular agentic AI paradigm promises to harness the power of multiple, general-purpose large language model (LLM) agents to collaboratively complete complex tasks. While many agentic AI systems utilize predefined workflows or agent roles...

1 min 1 month, 1 week ago

standing

LOW Academic International

Conservative Equilibrium Discovery in Offline Game-Theoretic Multiagent Reinforcement Learning

arXiv:2603.00374v1 Announce Type: new Abstract: Offline learning of strategies takes data efficiency to its extreme by restricting algorithms to a fixed dataset of state-action trajectories. We consider the problem in a mixed-motive multiagent setting, where the goal is to solve...

1 min 1 month, 1 week ago

discovery

LOW Academic United States

Confusion-Aware Rubric Optimization for LLM-based Automated Grading

arXiv:2603.00451v1 Announce Type: new Abstract: Accurate and unambiguous guidelines are critical for large language model (LLM) based graders, yet manually crafting these prompts is often sub-optimal as LLMs can misinterpret expert guidelines or lack necessary domain specificity. Consequently, the field...

1 min 1 month, 1 week ago

trial

LOW Academic International

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

arXiv:2603.00460v1 Announce Type: new Abstract: Clinical decision-making requires synthesizing heterogeneous evidence, including patient histories, clinical guidelines, and trajectories of comparable cases. While large language models (LLMs) offer strong reasoning capabilities, they remain prone to hallucinations and struggle to integrate long,...

1 min 1 month, 1 week ago

evidence

LOW Academic United States

From Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems

arXiv:2603.00472v1 Announce Type: new Abstract: Agentic AI systems exhibit numerous crosscutting concerns -- security, observability, cost management, fault tolerance -- that are poorly modularized in current implementations, contributing to the high failure rate of AI projects in reaching production. The...

1 min 1 month, 1 week ago

discovery

LOW Academic International

LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks

arXiv:2603.00490v1 Announce Type: new Abstract: The rapid progress of Multimodal Large Language Models (MLLMs) marks a significant step toward artificial general intelligence, offering great potential for augmenting human capabilities. However, their ability to provide effective assistance in dynamic, real-world environments...

1 min 1 month, 1 week ago

standing

LOW Academic International

MicroVerse: A Preliminary Exploration Toward a Micro-World Simulation

arXiv:2603.00585v1 Announce Type: new Abstract: Recent advances in video generation have opened new avenues for macroscopic simulation of complex dynamic systems, but their application to microscopic phenomena remains largely unexplored. Microscale simulation holds great promise for biomedical applications such as...

1 min 1 month, 1 week ago

discovery

LOW Academic International

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs

arXiv:2603.00590v1 Announce Type: new Abstract: As artificial intelligence (AI) is increasingly deployed across domains, ensuring fairness has become a core challenge. However, the field faces a "Tower of Babel'' dilemma: fairness metrics abound, yet their underlying philosophical assumptions often conflict,...

1 min 1 month, 1 week ago

standing

LOW Academic International

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

arXiv:2603.00873v1 Announce Type: new Abstract: With the increasing demand for step-wise, cross-modal, and knowledge-grounded reasoning, multimodal large language models (MLLMs) are evolving beyond the traditional fixed retrieve-then-generate paradigm toward more sophisticated agentic multimodal retrieval-augmented generation (MM-RAG). Existing benchmarks, however, mainly...

1 min 1 month, 1 week ago

evidence

InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

DoAtlas-1: A Causal Compilation Paradigm for Clinical AI

Beyond Behavioural Trade-Offs: Mechanistic Tracing of Pain-Pleasure Decisions in an LLM

Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

ALPACA: A Reinforcement Learning Environment for Medication Repurposing and Treatment Optimization in Alzheimer's Disease

Artificial Intelligence for Modeling & Simulation in Digital Twins

ReportLogic: Evaluating Logical Quality in Deep Research Reports

ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

INSURE-Dial: A Phase-Aware Conversational Dataset \& Benchmark for Compliance Verification and Phase Detection

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

Contradiction to Consensus: Dual Perspective, Multi Source Retrieval Based Claim Verification with Source Level Disagreement using LLM

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

BURMESE-SAN: Burmese NLP Benchmark for Evaluating Large Language Models

SleepLM: Natural-Language Intelligence for Human Sleep

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

Artificial Agency Program: Curiosity, compression, and communication in agents

Keyword search is all you need: Achieving RAG-Level Performance without vector databases using agentic tool use

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

DIG to Heal: Scaling General-purpose Agent Collaboration via Explainable Dynamic Decision Paths

Conservative Equilibrium Discovery in Offline Game-Theoretic Multiagent Reinforcement Learning

Confusion-Aware Rubric Optimization for LLM-based Automated Grading

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

From Goals to Aspects, Revisited: An NFR Pattern Language for Agentic AI Systems

LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks

MicroVerse: A Preliminary Exploration Toward a Micro-World Simulation

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.