Immigration Law

LOW Academic International

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

arXiv:2602.23452v1 Announce Type: new Abstract: Scientific research relies on accurate citation for attribution and integrity, yet large language models (LLMs) introduce a new risk: fabricated references that appear plausible but correspond to no real publications. Such hallucinated citations have already...

1 min 1 month, 1 week ago

ead

LOW Academic International

FHIRPath-QA: Executable Question Answering over FHIR Electronic Health Records

arXiv:2602.23479v1 Announce Type: new Abstract: Though patients are increasingly granted digital access to their electronic health records (EHRs), existing interfaces may not support precise, trustworthy answers to patient-specific questions. Large language models (LLM) show promise in clinical question answering (QA),...

1 min 1 month, 1 week ago

tps

LOW Academic International

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

arXiv:2603.00267v1 Announce Type: new Abstract: Misinformation spreading over the Internet poses a significant threat to both societies and individuals, necessitating robust and scalable fact-checking that relies on retrieving accurate and trustworthy evidence. Previous methods rely on semantic and social-contextual patterns...

1 min 1 month, 1 week ago

ead

LOW Academic International

Monotropic Artificial Intelligence: Toward a Cognitive Taxonomy of Domain-Specialized Language Models

arXiv:2603.00350v1 Announce Type: new Abstract: The prevailing paradigm in artificial intelligence research equates progress with scale: larger models trained on broader datasets are presumed to yield superior capabilities. This assumption, while empirically productive for general-purpose applications, obscures a fundamental epistemological...

1 min 1 month, 1 week ago

ead

LOW Academic International

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

arXiv:2603.00460v1 Announce Type: new Abstract: Clinical decision-making requires synthesizing heterogeneous evidence, including patient histories, clinical guidelines, and trajectories of comparable cases. While large language models (LLMs) offer strong reasoning capabilities, they remain prone to hallucinations and struggle to integrate long,...

1 min 1 month, 1 week ago

tps

LOW Academic International

MicroVerse: A Preliminary Exploration Toward a Micro-World Simulation

arXiv:2603.00585v1 Announce Type: new Abstract: Recent advances in video generation have opened new avenues for macroscopic simulation of complex dynamic systems, but their application to microscopic phenomena remains largely unexplored. Microscale simulation holds great promise for biomedical applications such as...

1 min 1 month, 1 week ago

tps

LOW Academic International

TraceSIR: A Multi-Agent Framework for Structured Analysis and Reporting of Agentic Execution Traces

arXiv:2603.00623v1 Announce Type: new Abstract: Agentic systems augment large language models with external tools and iterative decision making, enabling complex tasks such as deep research, function calling, and coding. However, their long and intricate execution traces make failure diagnosis and...

1 min 1 month, 1 week ago

tps

LOW Academic International

The Synthetic Web: Adversarially-Curated Mini-Internets for Diagnosing Epistemic Weaknesses of Language Agents

arXiv:2603.00801v1 Announce Type: new Abstract: Language agents increasingly act as web-enabled systems that search, browse, and synthesize information from diverse sources. However, these sources can include unreliable or adversarial content, and the robustness of agents to adversarial ranking - where...

1 min 1 month, 1 week ago

ead

LOW Academic International

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

arXiv:2603.00873v1 Announce Type: new Abstract: With the increasing demand for step-wise, cross-modal, and knowledge-grounded reasoning, multimodal large language models (MLLMs) are evolving beyond the traditional fixed retrieve-then-generate paradigm toward more sophisticated agentic multimodal retrieval-augmented generation (MM-RAG). Existing benchmarks, however, mainly...

1 min 1 month, 1 week ago

ead

LOW Academic International

HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents

arXiv:2603.00977v1 Announce Type: new Abstract: Large language model (LLM) agents have recently demonstrated strong capabilities in interactive decision-making, yet they remain fundamentally limited in long-horizon tasks that require structured planning and reliable execution. Existing approaches predominantly rely on flat autoregressive...

1 min 1 month, 1 week ago

ead

LOW Academic International

Tracking Capabilities for Safer Agents

arXiv:2603.00991v1 Announce Type: new Abstract: AI agents that interact with the real world through tool calls pose fundamental safety challenges: agents might leak private information, cause unintended side effects, or be manipulated through prompt injection. To address these challenges, we...

1 min 1 month, 1 week ago

ead

LOW Academic International

DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage

arXiv:2603.01106v1 Announce Type: new Abstract: Reinforcement learning (RL) with group relative policy optimization (GRPO) has become a widely adopted approach for enhancing the reasoning capabilities of multimodal large language models (MLLMs). While GRPO enables long-chain reasoning without a critic, it...

1 min 1 month, 1 week ago

tps

LOW Academic International

DeepResearch-9K: A Challenging Benchmark Dataset of Deep-Research Agent

arXiv:2603.01152v1 Announce Type: new Abstract: Deep-research agents are capable of executing multi-step web exploration, targeted retrieval, and sophisticated question answering. Despite their powerful capabilities, deep-research agents face two critical bottlenecks: (1) the lack of large-scale, challenging datasets with real-world difficulty,...

1 min 1 month, 1 week ago

tps

LOW Academic International

Incremental LTLf Synthesis

arXiv:2603.01201v1 Announce Type: new Abstract: In this paper, we study incremental LTLf synthesis -- a form of reactive synthesis where the goals are given incrementally while in execution. In other words, the protagonist agent is already executing a strategy for...

1 min 1 month, 1 week ago

ead

LOW Academic International

Noise reduction in BERT NER models for clinical entity extraction

arXiv:2603.00022v1 Announce Type: new Abstract: Precision is of utmost importance in the realm of clinical entity extraction from clinical notes and reports. Encoder Models fine-tuned for Named Entity Recognition (NER) are an efficient choice for this purpose, as they don't...

1 min 1 month, 1 week ago

removal

LOW Academic International

SimpleTool: Parallel Decoding for Real-Time LLM Function Calling

arXiv:2603.00030v1 Announce Type: new Abstract: LLM-based function calling enables intelligent agents to interact with external tools and environments, yet autoregressive decoding imposes a fundamental latency bottleneck that limits real-time applications such as embodied intelligence, game AI, and interactive avatars (e.g.,...

1 min 1 month, 1 week ago

ead

LOW Academic International

When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation

arXiv:2603.00314v1 Announce Type: new Abstract: This paper details the baseline model selection, fine-tuning process, evaluation methods, and the implications of deploying more accurate LLMs in healthcare settings. As large language models (LLMs) are increasingly employed to address diverse problems, including...

1 min 1 month, 1 week ago

ead

LOW Academic International

Diagnosing Retrieval vs. Utilization Bottlenecks in LLM Agent Memory

arXiv:2603.02473v1 Announce Type: new Abstract: Memory-augmented LLM agents store and retrieve information from prior interactions, yet the relative importance of how memories are written versus how they are retrieved remains unclear. We introduce a diagnostic framework that analyzes how performance...

1 min 1 month, 1 week ago

tps

LOW Academic International

See and Remember: A Multimodal Agent for Web Traversal

arXiv:2603.02626v1 Announce Type: new Abstract: Autonomous web navigation requires agents to perceive complex visual environments and maintain long-term context, yet current Large Language Model (LLM) based agents often struggle with spatial disorientation and navigation loops. In this paper, we propose...

1 min 1 month, 1 week ago

tps

LOW Academic International

Rethinking Code Similarity for Automated Algorithm Design with LLMs

arXiv:2603.02787v1 Announce Type: new Abstract: The rise of Large Language Model-based Automated Algorithm Design (LLM-AAD) has transformed algorithm development by autonomously generating code implementations of expert-level algorithms. Unlike traditional expert-driven algorithm development, in the LLM-AAD paradigm, the main design principle...

1 min 1 month, 1 week ago

tps

LOW Academic International

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

arXiv:2603.02908v1 Announce Type: new Abstract: In recent years, pre-trained large language models have achieved remarkable success across diverse tasks. Besides the pivotal role of self-supervised pre-training, their effectiveness in downstream applications also depends critically on the post-training process, which adapts...

1 min 1 month, 1 week ago

tps

LOW Academic International

OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

arXiv:2603.03005v1 Announce Type: new Abstract: Multi-agent large language model frameworks are promising for complex multi step reasoning, yet existing systems remain weak for scientific and knowledge intensive domains due to static prompts and agent roles, rigid workflows, and homogeneous model...

1 min 1 month, 1 week ago

ead

LOW Academic International

Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

arXiv:2603.03080v1 Announce Type: new Abstract: LLM-based explainable recommenders can produce fluent explanations that are factually correct, yet still justify items using attributes that conflict with a user's historical preferences. Such preference-inconsistent explanations yield logically valid but unconvincing reasoning and are...

1 min 1 month, 1 week ago

ead

LOW Academic International

Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

arXiv:2603.03116v1 Announce Type: new Abstract: Large Language Model (LLM)-based agents are increasingly adopted in high-stakes settings, but current benchmarks evaluate mainly whether a task was completed, not how. We introduce Procedure-Aware Evaluation (PAE), a framework that formalizes agent procedures as...

1 min 1 month, 1 week ago

ead

LOW Academic International

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

arXiv:2603.03203v1 Announce Type: new Abstract: CDD, or Contamination Detection via output Distribution, identifies data contamination by measuring the peakedness of a model's sampled outputs. We study the conditions under which this approach succeeds and fails on small language models ranging...

1 min 1 month, 1 week ago

tps

LOW Academic International

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

arXiv:2603.03242v1 Announce Type: new Abstract: Language models deployed in online communities must adapt to norms that vary across social, cultural, and domain-specific contexts. Prior alignment approaches rely on explicit preference supervision or predefined principles, which are effective for well-resourced settings...

1 min 1 month, 1 week ago

ead

LOW Academic International

Think, But Don't Overthink: Reproducing Recursive Language Models

arXiv:2603.02615v1 Announce Type: new Abstract: This project reproduces and extends the recently proposed ``Recursive Language Models'' (RLMs) framework by Zhang et al. (2026). This framework enables Large Language Models (LLMs) to process near-infinite contexts by offloading the prompt into an...

1 min 1 month, 1 week ago

tps

LOW Academic International

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

arXiv:2603.03680v1 Announce Type: new Abstract: Large Language Model (LLM) agents have demonstrated remarkable proficiency in learned tasks, yet they often struggle to adapt to non-stationary environments with feedback. While In-Context Learning and external memory offer some flexibility, they fail to...

1 min 1 month, 1 week ago

tps

LOW Academic International

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

arXiv:2603.04191v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly serving as personal assistants, where users share complex and diverse preferences over extended interactions. However, assessing how well LLMs can follow these preferences in realistic, long-term situations remains underexplored....

1 min 1 month, 1 week ago

tps

LOW Academic International

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

arXiv:2603.04514v1 Announce Type: new Abstract: Diffusion language models generate text through iterative denoising under a uniform refinement rule applied to all tokens. However, tokens stabilize at different rates in practice, leading to substantial redundant refinement and motivating refinement control over...

1 min 1 month, 1 week ago

ead

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

FHIRPath-QA: Executable Question Answering over FHIR Electronic Health Records

Multi-Sourced, Multi-Agent Evidence Retrieval for Fact-Checking

Monotropic Artificial Intelligence: Toward a Cognitive Taxonomy of Domain-Specialized Language Models

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

MicroVerse: A Preliminary Exploration Toward a Micro-World Simulation

TraceSIR: A Multi-Agent Framework for Structured Analysis and Reporting of Agentic Execution Traces

The Synthetic Web: Adversarially-Curated Mini-Internets for Diagnosing Epistemic Weaknesses of Language Agents

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents

Tracking Capabilities for Safer Agents

DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage

DeepResearch-9K: A Challenging Benchmark Dataset of Deep-Research Agent

Incremental LTLf Synthesis

Noise reduction in BERT NER models for clinical entity extraction

SimpleTool: Parallel Decoding for Real-Time LLM Function Calling

When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation

Diagnosing Retrieval vs. Utilization Bottlenecks in LLM Agent Memory

See and Remember: A Multimodal Agent for Web Traversal

Rethinking Code Similarity for Automated Algorithm Design with LLMs

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

Think, But Don't Overthink: Reproducing Recursive Language Models

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.