Immigration Law

LOW Academic International

Adaptive Vision-Language Model Routing for Computer Use Agents

arXiv:2603.12823v1 Announce Type: new Abstract: Computer Use Agents (CUAs) translate natural-language instructions into Graphical User Interface (GUI) actions such as clicks, keystrokes, and scrolls by relying on a Vision-Language Model (VLM) to interpret screenshots and predict grounded tool calls. However,...

1 min 1 month ago

tps

LOW Academic International

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

arXiv:2603.12963v1 Announce Type: new Abstract: The widespread adoption of reinforcement learning-based alignment highlights the growing importance of reward models. Various benchmarks have been built to evaluate reward models in various domains and scenarios. However, a significant gap remains in assessing...

1 min 1 month ago

ead

LOW Academic International

Multi-Step Semantic Reasoning in Generative Retrieval

arXiv:2603.12368v1 Announce Type: cross Abstract: Generative retrieval (GR) models encode a corpus within model parameters and generate relevant document identifiers directly for a given query. While this paradigm shows promise in retrieval tasks, existing GR models struggle with complex queries...

1 min 1 month ago

ead

LOW Academic International

Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization

arXiv:2603.12565v1 Announce Type: cross Abstract: SpeechLLMs typically combine ASR-trained encoders with text-based LLM backbones, leading them to inherit written-style output patterns unsuitable for text-to-speech synthesis. This mismatch is particularly pronounced in Japanese, where spoken and written registers differ substantially in...

1 min 1 month ago

ead

LOW Academic International

Sinkhorn-Drifting Generative Models

arXiv:2603.12366v1 Announce Type: new Abstract: We establish a theoretical link between the recently proposed "drifting" generative dynamics and gradient flows induced by the Sinkhorn divergence. In a particle discretization, the drift field admits a cross-minus-self decomposition: an attractive term toward...

1 min 1 month ago

tps

LOW Academic International

Probing Length Generalization in Mamba via Image Reconstruction

arXiv:2603.12499v1 Announce Type: new Abstract: Mamba has attracted widespread interest as a general-purpose sequence model due to its low computational complexity and competitive performance relative to transformers. However, its performance can degrade when inference sequence lengths exceed those seen during...

1 min 1 month ago

ead

LOW Academic International

Maximizing Incremental Information Entropy for Contrastive Learning

arXiv:2603.12594v1 Announce Type: new Abstract: Contrastive learning has achieved remarkable success in self-supervised representation learning, often guided by information-theoretic objectives such as mutual information maximization. Motivated by the limitations of static augmentations and rigid invariance constraints, we propose IE-CL (Incremental-Entropy...

1 min 1 month ago

l-1

LOW Academic International

Swap-guided Preference Learning for Personalized Reinforcement Learning from Human Feedback

arXiv:2603.12595v1 Announce Type: new Abstract: Reinforcement Learning from Human Feedback (RLHF) is a widely used approach to align large-scale AI systems with human values. However, RLHF typically assumes a single, universal reward, which overlooks diverse preferences and limits personalization. Variational...

1 min 1 month ago

tps

LOW Academic International

Human-AI Collaborative Autonomous Experimentation With Proxy Modeling for Comparative Observation

arXiv:2603.12618v1 Announce Type: new Abstract: Optimization for different tasks like material characterization, synthesis, and functional properties for desired applications over multi-dimensional control parameters need a rapid strategic search through active learning such as Bayesian optimization (BO). However, such high-dimensional experimental...

1 min 1 month ago

ead

LOW Academic International

LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing

arXiv:2603.12645v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) based Large Language Models (LLMs) have demonstrated impressive performance and computational efficiency. However, their deployment is often constrained by substantial memory demands, primarily due to the need to load numerous expert modules. While...

1 min 1 month ago

ead

LOW Academic International

RetroReasoner: A Reasoning LLM for Strategic Retrosynthesis Prediction

arXiv:2603.12666v1 Announce Type: new Abstract: Retrosynthesis prediction is a core task in organic synthesis that aims to predict reactants for a given product molecule. Traditionally, chemists select a plausible bond disconnection and derive corresponding reactants, which is time-consuming and requires...

1 min 1 month ago

ead

LOW News International

Before quantum computing arrives, this startup wants enterprises already running on it

After selling his AI startup to AMD for $665 million, Peter Sarlin is back with Qutwo, a new venture building the infrastructure it believes enterprises will need when quantum computing finally arrives.

1 min 1 month ago

ead

LOW Academic International

MDER-DR: Multi-Hop Question Answering with Entity-Centric Summaries

arXiv:2603.11223v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) over Knowledge Graphs (KGs) suffers from the fact that indexing approaches may lose important contextual nuance when text is reduced to triples, thereby degrading performance in downstream Question-Answering (QA) tasks, particularly for...

1 min 1 month ago

tps

LOW Academic International

Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing

arXiv:2603.11433v1 Announce Type: new Abstract: In modern transportation networks, adversaries can manipulate routing algorithms using false data injection attacks, such as simulating heavy traffic with multiple devices running crowdsourced navigation applications, to mislead vehicles toward suboptimal routes and increase congestion....

1 min 1 month ago

ead

LOW Academic International

PACED: Distillation at the Frontier of Student Competence

arXiv:2603.11178v1 Announce Type: new Abstract: Standard LLM distillation wastes compute on two fronts: problems the student has already mastered (near-zero gradients) and problems far beyond its reach (incoherent gradients that erode existing capabilities). We show that this waste is not...

1 min 1 month ago

ead

LOW Academic International

Summarize Before You Speak with ARACH: A Training-Free Inference-Time Plug-In for Enhancing LLMs via Global Attention Reallocation

arXiv:2603.11067v1 Announce Type: new Abstract: Large language models (LLMs) achieve remarkable performance, yet further gains often require costly training. This has motivated growing interest in post-training techniques-especially training-free approaches that improve models at inference time without updating weights. Most training-free...

1 min 1 month ago

ead

LOW Academic International

The Density of Cross-Persistence Diagrams and Its Applications

arXiv:2603.11623v1 Announce Type: new Abstract: Topological Data Analysis (TDA) provides powerful tools to explore the shape and structure of data through topological features such as clusters, loops, and voids. Persistence diagrams are a cornerstone of TDA, capturing the evolution of...

1 min 1 month ago

tps

LOW Academic International

Algorithmic Consequences of Particle Filters for Sentence Processing: Amplified Garden-Paths and Digging-In Effects

arXiv:2603.11412v1 Announce Type: new Abstract: Under surprisal theory, linguistic representations affect processing difficulty only through the bottleneck of surprisal. Our best estimates of surprisal come from large language models, which have no explicit representation of structural ambiguity. While LLM surprisal...

1 min 1 month ago

ead

LOW Academic International

Gender Bias in Generative AI-assisted Recruitment Processes

arXiv:2603.11736v1 Announce Type: new Abstract: In recent years, generative artificial intelligence (GenAI) systems have assumed increasingly crucial roles in selection processes, personnel recruitment and analysis of candidates' profiles. However, the employment of large language models (LLMs) risks reproducing, and in...

1 min 1 month ago

ead

LOW Academic International

RewardHackingAgents: Benchmarking Evaluation Integrity for LLM ML-Engineering Agents

arXiv:2603.11337v1 Announce Type: new Abstract: LLM agents increasingly perform end-to-end ML engineering tasks where success is judged by a single scalar test metric. This creates a structural vulnerability: an agent can increase the reported score by compromising the evaluation pipeline...

1 min 1 month ago

ead

LOW Academic International

Temporal Text Classification with Large Language Models

arXiv:2603.11295v1 Announce Type: new Abstract: Languages change over time. Computational models can be trained to recognize such changes enabling them to estimate the publication date of texts. Despite recent advancements in Large Language Models (LLMs), their performance on automatic dating...

1 min 1 month ago

ead

LOW Academic International

The Unlearning Mirage: A Dynamic Framework for Evaluating LLM Unlearning

arXiv:2603.11266v1 Announce Type: new Abstract: Unlearning in Large Language Models (LLMs) aims to enhance safety, mitigate biases, and comply with legal mandates, such as the right to be forgotten. However, existing unlearning methods are brittle: minor query modifications, such as...

1 min 1 month ago

tps

LOW Academic International

DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering

arXiv:2603.11798v1 Announce Type: new Abstract: Multi-document Multi-entity Question Answering inherently demands models to track implicit logic between multiple entities across scattered documents. However, existing Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) frameworks suffer from critical limitations: standard RAG's vector...

1 min 1 month ago

ead

LOW Academic International

Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment

arXiv:2603.11388v1 Announce Type: new Abstract: Safety alignment aims to ensure that large language models (LLMs) refuse harmful requests by post-training on harmful queries paired with refusal answers. Although safety alignment is widely adopted in industry, the overrefusal problem where aligned...

1 min 1 month ago

ead

LOW Academic International

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

arXiv:2603.11414v1 Announce Type: new Abstract: We present MaterialFigBench, a benchmark dataset designed to evaluate the ability of multimodal large language models (LLMs) to solve university-level materials science problems that require accurate interpretation of figures. Unlike existing benchmarks that primarily rely...

1 min 1 month ago

ead

LOW Academic International

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

arXiv:2603.11863v1 Announce Type: new Abstract: The saturation of high-quality pre-training data has shifted research focus toward evolutionary systems capable of continuously generating novel artifacts, leading to the success of AlphaEvolve. However, the progress of such systems is hindered by the...

1 min 1 month ago

ead

LOW Academic International

Markovian Generation Chains in Large Language Models

arXiv:2603.11228v1 Announce Type: new Abstract: The widespread use of large language models (LLMs) raises an important question: how do texts evolve when they are repeatedly processed by LLMs? In this paper, we define this iterative inference process as Markovian generation...

1 min 1 month ago

ead

LOW Academic International

ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

arXiv:2603.11281v1 Announce Type: new Abstract: Medical question-answering benchmarks predominantly evaluate single-turn exchanges, failing to capture the iterative, clarification-seeking nature of real patient consultations. We introduce ThReadMed-QA, a benchmark of 2,437 fully-answered patient-physician conversation threads extracted from r/AskDocs, comprising 8,204 question-answer...

1 min 1 month ago

ead

LOW Academic International

Compression Favors Consistency, Not Truth: When and Why Language Models Prefer Correct Information

arXiv:2603.11749v1 Announce Type: new Abstract: Why do language models sometimes prefer correct statements even when trained on mixed-quality data? We introduce the Compression--Consistency Principle: next-token prediction favors hypotheses that allow shorter and more internally consistent descriptions of the training data....

1 min 1 month ago

tps

LOW Academic International

DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining

arXiv:2603.11838v1 Announce Type: new Abstract: In financial backtesting, large language models pretrained on internet-scale data risk introducing lookahead bias that undermines their forecasting validity, as they may have already seen the true outcome during training. To address this, we present...

1 min 1 month ago

ead

Adaptive Vision-Language Model Routing for Computer Use Agents

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

Multi-Step Semantic Reasoning in Generative Retrieval

Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization

Sinkhorn-Drifting Generative Models

Probing Length Generalization in Mamba via Image Reconstruction

Maximizing Incremental Information Entropy for Contrastive Learning

Swap-guided Preference Learning for Personalized Reinforcement Learning from Human Feedback

Human-AI Collaborative Autonomous Experimentation With Proxy Modeling for Comparative Observation

LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing

RetroReasoner: A Reasoning LLM for Strategic Retrosynthesis Prediction

Before quantum computing arrives, this startup wants enterprises already running on it

MDER-DR: Multi-Hop Question Answering with Entity-Centric Summaries

Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing

PACED: Distillation at the Frontier of Student Competence

Summarize Before You Speak with ARACH: A Training-Free Inference-Time Plug-In for Enhancing LLMs via Global Attention Reallocation

The Density of Cross-Persistence Diagrams and Its Applications

Algorithmic Consequences of Particle Filters for Sentence Processing: Amplified Garden-Paths and Digging-In Effects

Gender Bias in Generative AI-assisted Recruitment Processes

RewardHackingAgents: Benchmarking Evaluation Integrity for LLM ML-Engineering Agents

Temporal Text Classification with Large Language Models

The Unlearning Mirage: A Dynamic Framework for Evaluating LLM Unlearning

DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering

Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

Markovian Generation Chains in Large Language Models

ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

Compression Favors Consistency, Not Truth: When and Why Language Models Prefer Correct Information

DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.