Immigration Law

LOW Academic International

Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

arXiv:2603.03080v1 Announce Type: new Abstract: LLM-based explainable recommenders can produce fluent explanations that are factually correct, yet still justify items using attributes that conflict with a user's historical preferences. Such preference-inconsistent explanations yield logically valid but unconvincing reasoning and are...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

arXiv:2603.03116v1 Announce Type: new Abstract: Large Language Model (LLM)-based agents are increasingly adopted in high-stakes settings, but current benchmarks evaluate mainly whether a task was completed, not how. We introduce Procedure-Aware Evaluation (PAE), a framework that formalizes agent procedures as...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Neuro-Symbolic Artificial Intelligence: A Task-Directed Survey in the Black-Box Models Era

arXiv:2603.03177v1 Announce Type: new Abstract: The integration of symbolic computing with neural networks has intrigued researchers since the first theorizations of Artificial intelligence (AI). The ability of Neuro-Symbolic (NeSy) methods to infer or exploit behavioral schema has been widely considered...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

arXiv:2603.03203v1 Announce Type: new Abstract: CDD, or Contamination Detection via output Distribution, identifies data contamination by measuring the peakedness of a model's sampled outputs. We study the conditions under which this approach succeeds and fails on small language models ranging...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

arXiv:2603.03242v1 Announce Type: new Abstract: Language models deployed in online communities must adapt to norms that vary across social, cultural, and domain-specific contexts. Prior alignment approaches rely on explicit preference supervision or predefined principles, which are effective for well-resourced settings...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Think, But Don't Overthink: Reproducing Recursive Language Models

arXiv:2603.02615v1 Announce Type: new Abstract: This project reproduces and extends the recently proposed ``Recursive Language Models'' (RLMs) framework by Zhang et al. (2026). This framework enables Large Language Models (LLMs) to process near-infinite contexts by offloading the prompt into an...

1 min 1 month, 2 weeks ago

tps

LOW Academic European Union

Mozi: Governed Autonomy for Drug Discovery LLM Agents

arXiv:2603.03655v1 Announce Type: new Abstract: Tool-augmented large language model (LLM) agents promise to unify scientific reasoning with computation, yet their deployment in high-stakes domains like drug discovery is bottlenecked by two critical barriers: unconstrained tool-use governance and poor long-horizon reliability....

1 min 1 month, 2 weeks ago

ead

LOW Academic International

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

arXiv:2603.03680v1 Announce Type: new Abstract: Large Language Model (LLM) agents have demonstrated remarkable proficiency in learned tasks, yet they often struggle to adapt to non-stationary environments with feedback. While In-Context Learning and external memory offer some flexibility, they fail to...

1 min 1 month, 2 weeks ago

tps

LOW Academic European Union

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

arXiv:2603.03686v1 Announce Type: new Abstract: Automated design of chemical formulations is a cornerstone of materials science, yet it requires navigating a high-dimensional combinatorial space involving discrete compositional choices and continuous geometric constraints. Existing Large Language Model (LLM) agents face significant...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

arXiv:2603.03761v1 Announce Type: new Abstract: LLM agents are rapidly becoming the practical interface for task automation, yet the ecosystem lacks a principled way to choose among an exploding space of deployable configurations. Existing LLM leaderboards and tool/agent benchmarks evaluate components...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

arXiv:2603.04191v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly serving as personal assistants, where users share complex and diverse preferences over extended interactions. However, assessing how well LLMs can follow these preferences in realistic, long-term situations remains underexplored....

1 min 1 month, 2 weeks ago

tps

LOW Academic International

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

arXiv:2603.04514v1 Announce Type: new Abstract: Diffusion language models generate text through iterative denoising under a uniform refinement rule applied to all tokens. However, tokens stabilize at different rates in practice, leading to substantial redundant refinement and motivating refinement control over...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Discovering mathematical concepts through a multi-agent system

arXiv:2603.04528v1 Announce Type: new Abstract: Mathematical concepts emerge through an interplay of processes, including experimentation, efforts at proof, and counterexamples. In this paper, we present a new multi-agent model for computational mathematical discovery based on this observation. Our system, conceived...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Self-Attribution Bias: When AI Monitors Go Easy on Themselves

arXiv:2603.04582v1 Announce Type: new Abstract: Agentic systems increasingly rely on language models to monitor their own behavior. For example, coding agents may self critique generated code for pull request approval or assess the safety of tool-use actions. We show that...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery

arXiv:2603.04735v1 Announce Type: new Abstract: This paper demonstrates that artificial intelligence can accelerate mathematical discovery by autonomously solving an open problem in theoretical physics. We present a neuro-symbolic system, combining the Gemini Deep Think large language model with a systematic...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Interactive Benchmarks

arXiv:2603.04737v1 Announce Type: new Abstract: Standard benchmarks have become increasingly unreliable due to saturation, subjectivity, and poor generalization. We argue that evaluating model's ability to acquire information actively is important to assess model's intelligence. We propose Interactive Benchmarks, a unified...

1 min 1 month, 2 weeks ago

tps

LOW Academic United States

Evaluating the Search Agent in a Parallel World

arXiv:2603.04751v1 Announce Type: new Abstract: Integrating web search tools has significantly extended the capability of LLMs to address open-world, real-time, and long-tail problems. However, evaluating these Search Agents presents formidable challenges. First, constructing high-quality deep search benchmarks is prohibitively expensive,...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction

arXiv:2603.04783v1 Announce Type: new Abstract: While LLMs demonstrate strong reasoning capabilities when provided with full information in a single turn, they exhibit substantial vulnerability in multi-turn interactions. Specifically, when information is revealed incrementally or requires updates, models frequently fail to...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

arXiv:2603.04791v1 Announce Type: new Abstract: We introduce Timer-S1, a strong Mixture-of-Experts (MoE) time series foundation model with 8.3B total parameters, 0.75B activated parameters for each token, and a context length of 11.5K. To overcome the scalability bottleneck in existing pre-trained...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

arXiv:2603.04822v1 Announce Type: new Abstract: Aligning Large Language Models (LLMs) with nuanced human values remains a critical challenge, as existing methods like Reinforcement Learning from Human Feedback (RLHF) often handle only coarse-grained attributes. In practice, fine-tuning LLMs on task-specific datasets...

1 min 1 month, 2 weeks ago

visa

LOW Academic European Union

On Multi-Step Theorem Prediction via Non-Parametric Structural Priors

arXiv:2603.04852v1 Announce Type: new Abstract: Multi-step theorem prediction is a central challenge in automated reasoning. Existing neural-symbolic approaches rely heavily on supervised parametric models, which exhibit limited generalization to evolving theorem libraries. In this work, we explore training-free theorem prediction...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Causally Robust Reward Learning from Reason-Augmented Preference Feedback

arXiv:2603.04861v1 Announce Type: new Abstract: Preference-based reward learning is widely used for shaping agent behavior to match a user's preference, yet its sparse binary feedback makes it especially vulnerable to causal confusion. The learned reward often latches onto spurious features...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation

arXiv:2603.04868v1 Announce Type: new Abstract: Generating realistic and diverse trajectories is a critical challenge in autonomous driving simulation. While Large Language Models (LLMs) show promise, existing methods often rely on structured data like vectorized maps, which fail to capture the...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

SEA-TS: Self-Evolving Agent for Autonomous Code Generation of Time Series Forecasting Algorithms

arXiv:2603.04873v1 Announce Type: new Abstract: Accurate time series forecasting underpins decision-making across domains, yet conventional ML development suffers from data scarcity in new deployments, poor adaptability under distribution shift, and diminishing returns from manual iteration. We propose Self-Evolving Agent for...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Bounded State in an Infinite Horizon: Proactive Hierarchical Memory for Ad-Hoc Recall over Streaming Dialogues

arXiv:2603.04885v1 Announce Type: new Abstract: Real-world dialogue usually unfolds as an infinite stream. It thus requires bounded-state memory mechanisms to operate within an infinite horizon. However, existing read-then-think memory is fundamentally misaligned with this setting, as it cannot support ad-hoc...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

Rethinking Representativeness and Diversity in Dynamic Data Selection

arXiv:2603.04981v1 Announce Type: new Abstract: Dynamic data selection accelerates training by sampling a changing subset of the dataset while preserving accuracy. We rethink two core notions underlying sample evaluation: representativeness and diversity. Instead of local geometric centrality, we define representativeness...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

S5-SHB Agent: Society 5.0 enabled Multi-model Agentic Blockchain Framework for Smart Home

arXiv:2603.05027v1 Announce Type: new Abstract: The smart home is a key application domain within the Society 5.0 vision for a human-centered society. As smart home ecosystems expand with heterogeneous IoT protocols, diverse devices, and evolving threats, autonomous systems must manage...

1 min 1 month, 2 weeks ago

adjustment

LOW Academic United States

Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

arXiv:2603.05028v1 Announce Type: new Abstract: As Large Language Models (LLMs) evolve from chatbots to agentic assistants, they are increasingly observed to exhibit risky behaviors when subjected to survival pressure, such as the threat of being shut down. While multiple cases...

1 min 1 month, 2 weeks ago

tps

LOW Academic United States

Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

arXiv:2603.05040v1 Announce Type: new Abstract: Recent advancements in zero-shot commonsense reasoning have empowered Pre-trained Language Models (PLMs) to acquire extensive commonsense knowledge without requiring task-specific fine-tuning. Despite this progress, these models frequently suffer from limitations caused by human reporting biases...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile

arXiv:2603.05069v1 Announce Type: new Abstract: Personal AI agents face a fundamental deployment paradox on mobile: persistent background execution drains battery and violates platform sandboxing policies, yet purely reactive agents miss time-sensitive obligations until the user remembers to ask. We present...

1 min 1 month, 2 weeks ago

ead

Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation

Neuro-Symbolic Artificial Intelligence: A Task-Directed Survey in the Black-Box Models Era

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals

Think, But Don't Overthink: Reproducing Recursive Language Models

Mozi: Governed Autonomy for Drug Discovery LLM Agents

MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding

Discovering mathematical concepts through a multi-agent system

Self-Attribution Bias: When AI Monitors Go Easy on Themselves

Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery

Interactive Benchmarks

Evaluating the Search Agent in a Parallel World

Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction

Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

On Multi-Step Theorem Prediction via Non-Parametric Structural Priors

Causally Robust Reward Learning from Reason-Augmented Preference Feedback

K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation

SEA-TS: Self-Evolving Agent for Autonomous Code Generation of Time Series Forecasting Algorithms

Bounded State in an Infinite Horizon: Proactive Hierarchical Memory for Ad-Hoc Recall over Streaming Dialogues

Rethinking Representativeness and Diversity in Dynamic Data Selection

S5-SHB Agent: Society 5.0 enabled Multi-model Agentic Blockchain Framework for Smart Home

Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination

Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.