Immigration Law

LOW Academic International

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

arXiv:2602.18487v1 Announce Type: new Abstract: This paper introduces GLiNER-bi-Encoder, a novel architecture for Named Entity Recognition (NER) that harmonizes zero-shot flexibility with industrial-scale efficiency. While the original GLiNER framework offers strong generalization, its joint-encoding approach suffers from quadratic complexity as...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Luna-2: Scalable Single-Token Evaluation with Small Language Models

arXiv:2602.18583v1 Announce Type: new Abstract: Real-time guardrails require evaluation that is accurate, cheap, and fast - yet today's default, LLM-as-a-judge (LLMAJ), is slow, expensive, and operationally non-deterministic due to multi-token generation. We present Luna-2, a novel architecture that leverages decoder-only...

1 min 1 month, 2 weeks ago

ead

LOW Academic United Kingdom

From Trial by Fire To Sleep Like a Baby: A Lexicon of Anxiety Associations for 20k English Multiword Expressions

arXiv:2602.18692v1 Announce Type: new Abstract: Anxiety is the unease about a possible future negative outcome. In recent years, there has been growing interest in understanding how anxiety relates to our health, well-being, body, mind, and behaviour. This includes work on...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

Contradiction to Consensus: Dual Perspective, Multi Source Retrieval Based Claim Verification with Source Level Disagreement using LLM

arXiv:2602.18693v1 Announce Type: new Abstract: The spread of misinformation across digital platforms can pose significant societal risks. Claim verification, a.k.a. fact-checking, systems can help identify potential misinformation. However, their efficacy is limited by the knowledge sources that they rely on....

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

arXiv:2602.18734v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) has demonstrated strong effectiveness in knowledge-intensive tasks by grounding language generation in external evidence. Despite its success, many existing RAG systems are built based on a ranking-centric, asymmetric dependency paradigm, where the...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

ArabicNumBench: Evaluating Arabic Number Reading in Large Language Models

arXiv:2602.18776v1 Announce Type: new Abstract: We present ArabicNumBench, a comprehensive benchmark for evaluating large language models on Arabic number reading tasks across Eastern Arabic-Indic numerals (0-9 in Arabic script) and Western Arabic numerals (0-9). We evaluate 71 models from 10...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

arXiv:2602.18964v1 Announce Type: new Abstract: Sarcasm detection poses a fundamental challenge in computational semantics, requiring models to resolve disparities between literal and intended meaning. The challenge is amplified in low-resource languages where annotated datasets are scarce or nonexistent. We present...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

HumanMCP: A Human-Like Query Dataset for Evaluating MCP Tool Retrieval Performance

arXiv:2602.23367v1 Announce Type: new Abstract: Model Context Protocol (MCP) servers contain a collection of thousands of open-source standardized tools, linking LLMs to external systems; however, existing datasets and benchmarks lack realistic, human-like user queries, remaining a critical gap in evaluating...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

MMKG-RDS: Reasoning Data Synthesis via Deep Mining of Multimodal Knowledge Graphs

arXiv:2602.23632v1 Announce Type: new Abstract: Synthesizing high-quality training data is crucial for enhancing domain models' reasoning abilities. Existing methods face limitations in long-tail knowledge coverage, effectiveness verification, and interpretability. Knowledge-graph-based approaches still fall short in functionality, granularity, customizability, and evaluation....

1 min 1 month, 2 weeks ago

tps

LOW Academic European Union

PseudoAct: Leveraging Pseudocode Synthesis for Flexible Planning and Action Control in Large Language Model Agents

arXiv:2602.23668v1 Announce Type: new Abstract: Large language model (LLM) agents typically rely on reactive decision-making paradigms such as ReAct, selecting actions conditioned on growing execution histories. While effective for short tasks, these approaches often lead to redundant tool usage, unstable...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

From Flat Logs to Causal Graphs: Hierarchical Failure Attribution for LLM-based Multi-Agent Systems

arXiv:2602.23701v1 Announce Type: new Abstract: LLM-powered Multi-Agent Systems (MAS) have demonstrated remarkable capabilities in complex domains but suffer from inherent fragility and opaque failure mechanisms. Existing failure attribution methods, whether relying on direct prompting, costly replays, or supervised fine-tuning, typically...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation

arXiv:2602.23716v1 Announce Type: new Abstract: Large Language Model (LLM)-based agents show promise for e-commerce conversational shopping, yet existing implementations lack the interaction depth and contextual breadth required for complex product research. Meanwhile, the Deep Research paradigm, despite advancing information synthesis...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Reasoning-Driven Multimodal LLM for Domain Generalization

arXiv:2602.23777v1 Announce Type: new Abstract: This paper addresses the domain generalization (DG) problem in deep learning. While most DG methods focus on enforcing visual feature invariance, we leverage the reasoning capability of multimodal large language models (MLLMs) and explore the...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

arXiv:2602.23876v1 Announce Type: new Abstract: Designing efficient reward functions for low-level control tasks is a challenging problem. Recent research aims to reduce reliance on expert experience by using Large Language Models (LLMs) with task information to generate dense reward functions....

1 min 1 month, 2 weeks ago

tps

LOW Academic International

Pessimistic Auxiliary Policy for Offline Reinforcement Learning

arXiv:2602.23974v1 Announce Type: new Abstract: Offline reinforcement learning aims to learn an agent from pre-collected datasets, avoiding unsafe and inefficient real-time interaction. However, inevitable access to out-ofdistribution actions during the learning process introduces approximation errors, causing the error accumulation and...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

Portfolio Reinforcement Learning with Scenario-Context Rollout

arXiv:2602.24037v1 Announce Type: new Abstract: Market regime shifts induce distribution shifts that can degrade the performance of portfolio rebalancing policies. We propose macro-conditioned scenario-context rollout (SCR) that generates plausible next-day multivariate return scenarios under stress events. However, doing so faces...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance

arXiv:2602.24110v1 Announce Type: new Abstract: Reinforcement Learning from Verifiable Rewards (RLVR) has emerged as a powerful paradigm for enhancing the complex reasoning capabilities of Large Reasoning Models. However, standard outcome-based supervision suffers from a critical limitation that penalizes trajectories that...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics

arXiv:2602.24173v1 Announce Type: new Abstract: We present a new approach for benchmarking Large Language Model (LLM) capabilities on research-level mathematics. Existing benchmarks largely rely on static, hand-curated sets of contest or textbook-style problems as proxies for mathematical research. Instead, we...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Reason to Contrast: A Cascaded Multimodal Retrieval Framework

arXiv:2602.23369v1 Announce Type: cross Abstract: Traditional multimodal retrieval systems rely primarily on bi-encoder architectures, where performance is closely tied to embedding dimensionality. Recent work, Think-Then-Embed (TTE), shows that incorporating multimodal reasoning to elicit additional informative tokens before embedding can further...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

arXiv:2602.23370v1 Announce Type: cross Abstract: Long-document topic segmentation plays an important role in information retrieval and document understanding, yet existing methods still show clear shortcomings in ultra-long text settings. Traditional discriminative models are constrained by fixed windows and cannot model...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Hello-Chat: Towards Realistic Social Audio Interactions

arXiv:2602.23387v1 Announce Type: cross Abstract: Recent advancements in Large Audio Language Models (LALMs) have demonstrated exceptional performance in speech recognition and translation. However, existing models often suffer from a disconnect between perception and expression, resulting in a robotic "read-speech" style...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

arXiv:2602.23388v1 Announce Type: cross Abstract: The rising demand for inclusive speech technologies amplifies the need for multilingual datasets for Natural Language Processing (NLP) research. However, limited awareness of existing task-specific resources in low-resource languages hinders research. This challenge is especially...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

SALIENT: Frequency-Aware Paired Diffusion for Controllable Long-Tail CT Detection

arXiv:2602.23447v1 Announce Type: cross Abstract: Detection of rare lesions in whole-body CT is fundamentally limited by extreme class imbalance and low target-to-volume ratios, producing precision collapse despite high AUROC. Synthetic augmentation with diffusion models offers promise, yet pixel-space diffusion is...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

Optimization of Edge Directions and Weights for Mixed Guidance Graphs in Lifelong Multi-Agent Path Finding

arXiv:2602.23468v1 Announce Type: cross Abstract: Multi-Agent Path Finding (MAPF) aims to move agents from their start to goal vertices on a graph. Lifelong MAPF (LMAPF) continuously assigns new goals to agents as they complete current ones. To guide agents' movement...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

arXiv:2602.23499v1 Announce Type: cross Abstract: Collecting a high-quality dataset is a critical task that demands meticulous attention to detail, as overlooking certain aspects can render the entire dataset unusable. Autonomous driving challenges remain a prominent area of research, requiring further...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

arXiv:2602.23452v1 Announce Type: new Abstract: Scientific research relies on accurate citation for attribution and integrity, yet large language models (LLMs) introduce a new risk: fabricated references that appear plausible but correspond to no real publications. Such hallucinated citations have already...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

FHIRPath-QA: Executable Question Answering over FHIR Electronic Health Records

arXiv:2602.23479v1 Announce Type: new Abstract: Though patients are increasingly granted digital access to their electronic health records (EHRs), existing interfaces may not support precise, trustworthy answers to patient-specific questions. Large language models (LLM) show promise in clinical question answering (QA),...

1 min 1 month, 2 weeks ago

tps

LOW Academic United States

IDP Accelerator: Agentic Document Intelligence from Extraction to Compliance Validation

arXiv:2602.23481v1 Announce Type: new Abstract: Understanding and extracting structured insights from unstructured documents remains a foundational challenge in industrial NLP. While Large Language Models (LLMs) enable zero-shot extraction, traditional pipelines often fail to handle multi-document packets, complex reasoning, and strict...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

arXiv:2603.00267v1 Announce Type: new Abstract: Misinformation spreading over the Internet poses a significant threat to both societies and individuals, necessitating robust and scalable fact-checking that relies on retrieving accurate and trustworthy evidence. Previous methods rely on semantic and social-contextual patterns...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

EmCoop: A Framework and Benchmark for Embodied Cooperation Among LLM Agents

arXiv:2603.00349v1 Announce Type: new Abstract: Real-world scenarios increasingly require multiple embodied agents to collaborate in dynamic environments under embodied constraints, as many tasks exceed the capabilities of any single agent. Recent advances in large language models (LLMs) enable high-level cognitive...

1 min 1 month, 2 weeks ago

tps

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

Luna-2: Scalable Single-Token Evaluation with Small Language Models

From Trial by Fire To Sleep Like a Baby: A Lexicon of Anxiety Associations for 20k English Multiword Expressions

Contradiction to Consensus: Dual Perspective, Multi Source Retrieval Based Claim Verification with Source Level Disagreement using LLM

Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making Problem

ArabicNumBench: Evaluating Arabic Number Reading in Large Language Models

Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

HumanMCP: A Human-Like Query Dataset for Evaluating MCP Tool Retrieval Performance

MMKG-RDS: Reasoning Data Synthesis via Deep Mining of Multimodal Knowledge Graphs

PseudoAct: Leveraging Pseudocode Synthesis for Flexible Planning and Action Control in Large Language Model Agents

From Flat Logs to Causal Graphs: Hierarchical Failure Attribution for LLM-based Multi-Agent Systems

ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation

Reasoning-Driven Multimodal LLM for Domain Generalization

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

Pessimistic Auxiliary Policy for Offline Reinforcement Learning

Portfolio Reinforcement Learning with Scenario-Context Rollout

Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance

LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics

Reason to Contrast: A Cascaded Multimodal Retrieval Framework

Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

Hello-Chat: Towards Realistic Social Audio Interactions

Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

SALIENT: Frequency-Aware Paired Diffusion for Controllable Long-Tail CT Detection

Optimization of Edge Directions and Weights for Mixed Guidance Graphs in Lifelong Multi-Agent Path Finding

TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

FHIRPath-QA: Executable Question Answering over FHIR Electronic Health Records

IDP Accelerator: Agentic Document Intelligence from Extraction to Compliance Validation

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

EmCoop: A Framework and Benchmark for Embodied Cooperation Among LLM Agents

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.