Immigration Law

LOW Academic International

Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

arXiv:2603.03535v1 Announce Type: new Abstract: While large language models (LLMs) fine-tuned with lightweight adapters achieve strong performance across diverse tasks, their performance on individual tasks depends on the fine-tuning strategy. Fusing independently trained models with different strengths has shown promise...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

arXiv:2603.03538v1 Announce Type: new Abstract: Large language models with chain-of-thought generation have demonstrated great potential for producing complex mathematical proofs. However, their reasoning can often go astray, leading to increasing interest in formal and learned verifiers. A major challenge in...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

Hybrid Belief Reinforcement Learning for Efficient Coordinated Spatial Exploration

arXiv:2603.03595v1 Announce Type: new Abstract: Coordinating multiple autonomous agents to explore and serve spatially heterogeneous demand requires jointly learning unknown spatial patterns and planning trajectories that maximize task performance. Pure model-based approaches provide structured uncertainty estimates but lack adaptive policy...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

arXiv:2603.03597v1 Announce Type: new Abstract: The rapid progress of large language models (LLMs) is increasingly constrained by memory and deployment costs, motivating compression methods for practical deployment. Many state-of-the-art compression pipelines leverage the low-rank structure of trained weight matrices, a...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Why Are Linear RNNs More Parallelizable?

arXiv:2603.03612v1 Announce Type: new Abstract: The community is increasingly exploring linear RNNs (LRNNs) as language models, motivated by their expressive power and parallelizability. While prior work establishes the expressivity benefits of LRNNs over transformers, it is unclear what makes LRNNs...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

Freezing of Gait Prediction using Proactive Agent that Learns from Selected Experience and DDQN Algorithm

arXiv:2603.03651v1 Announce Type: new Abstract: Freezing of Gait (FOG) is a debilitating motor symptom commonly experienced by individuals with Parkinson's Disease (PD) which often leads to falls and reduced mobility. Timely and accurate prediction of FOG episodes is essential for...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Graph Negative Feedback Bias Correction Framework for Adaptive Heterophily Modeling

arXiv:2603.03662v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) have emerged as a powerful framework for processing graph-structured data. However, conventional GNNs and their variants are inherently limited by the homophily assumption, leading to degradation in performance on heterophilic graphs....

1 min 1 month, 2 weeks ago

ead

LOW Academic International

A Stein Identity for q-Gaussians with Bounded Support

arXiv:2603.03673v1 Announce Type: new Abstract: Stein's identity is a fundamental tool in machine learning with applications in generative models, stochastic optimization, and other problems involving gradients of expectations under Gaussian distributions. Less attention has been paid to problems with non-Gaussian...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

arXiv:2603.03805v1 Announce Type: new Abstract: Relational Databases (RDBs) are the backbone of modern business, yet they lack foundation models comparable to those in text or vision. A key obstacle is that high-quality RDBs are private, scarce and structurally heterogeneous, making...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

arXiv:2603.03818v1 Announce Type: new Abstract: Continual learning is a long-standing challenge in robot policy learning, where a policy must acquire new skills over time without catastrophically forgetting previously learned ones. While prior work has extensively studied continual learning in relatively...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

Fairness Begins with State: Purifying Latent Preferences for Hierarchical Reinforcement Learning in Interactive Recommendation

arXiv:2603.03820v1 Announce Type: new Abstract: Interactive recommender systems (IRS) are increasingly optimized with Reinforcement Learning (RL) to capture the sequential nature of user-system dynamics. However, existing fairness-aware methods often suffer from a fundamental oversight: they assume the observed user state...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

arXiv:2603.02684v1 Announce Type: new Abstract: Subtle and indirect hate speech remains an underexplored challenge in online safety research, particularly when harmful intent is embedded within misleading or manipulative narratives. Existing hate speech datasets primarily capture overt toxicity, underrepresenting the nuanced...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization

arXiv:2603.02701v1 Announce Type: new Abstract: Optimizing communication topology is fundamental to the efficiency and effectiveness of Large Language Model (LLM)-based Multi-Agent Systems (MAS). While recent approaches utilize reinforcement learning to dynamically construct task-specific graphs, they typically rely on single-sample policy...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench

arXiv:2603.02775v1 Announce Type: new Abstract: Large Language Models (LLMs) show significant potential in AI mathematical tutoring, yet current evaluations often rely on simplistic metrics or narrow pedagogical scenarios, failing to assess comprehensive, multi-turn teaching effectiveness. In this paper, we introduce...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

arXiv:2603.02865v1 Announce Type: new Abstract: Large vision-language models (LVLMs) demonstrate strong performance on diagram understanding benchmarks, yet they still struggle with understanding relationships between elements, particularly those represented by nodes and directed edges (e.g., arrows and lines). To investigate the...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

arXiv:2603.02945v1 Announce Type: new Abstract: Model merging aims to combine multiple task-specific expert models into a single model while preserving generalization across diverse tasks. However, interference among experts, especially when they are trained on different objectives, often leads to significant...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

arXiv:2603.03054v1 Announce Type: new Abstract: Large language models are increasingly used for patient-facing medical assistance and clinical decision support, but adapting them to clinical dialogue often requires supervision derived from doctor-patient conversations that may contain sensitive information. Conventional supervised fine-tuning...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

arXiv:2603.03202v1 Announce Type: new Abstract: As large language models (LLMs) advance their mathematical capabilities toward the IMO level, the scarcity of challenging, high-quality problems for training and evaluation has become a significant bottleneck. Simultaneously, recent code agents have demonstrated sophisticated...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

A Directed Graph Model and Experimental Framework for Design and Study of Time-Dependent Text Visualisation

arXiv:2603.02422v1 Announce Type: cross Abstract: Exponential growth in the quantity of digital news, social media, and other textual sources makes it difficult for humans to keep up with rapidly evolving narratives about world events. Various visualisation techniques have been touted...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

arXiv:2603.02482v1 Announce Type: cross Abstract: Safety evaluation and red-teaming of large language models remain predominantly text-centric, and existing frameworks lack the infrastructure to systematically test whether alignment generalizes to audio, image, and video inputs. We present MUSE (Multimodal Unified Safety...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

arXiv:2603.02556v1 Announce Type: cross Abstract: Reasoning has emerged as a key capability of large language models. In linguistic tasks, this capability can be enhanced by self-improving techniques that refine reasoning paths for subsequent finetuning. However, extending these language-based self-improving approaches...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

FlashEvaluator: Expanding Search Space with Parallel Evaluation

arXiv:2603.02565v1 Announce Type: cross Abstract: The Generator-Evaluator (G-E) framework, i.e., evaluating K sequences from a generator and selecting the top-ranked one according to evaluator scores, is a foundational paradigm in tasks such as Recommender Systems (RecSys) and Natural Language Processing...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

arXiv:2603.02215v1 Announce Type: new Abstract: Chemical reaction prediction is pivotal for accelerating drug discovery and synthesis planning. Despite advances in data-driven models, current approaches are hindered by an overemphasis on parameter and dataset scaling. Some methods coupled with evaluation techniques...

1 min 1 month, 2 weeks ago

tps

LOW Academic International

NExT-Guard: Training-Free Streaming Safeguard without Token-Level Labels

arXiv:2603.02219v1 Announce Type: new Abstract: Large language models are increasingly deployed in streaming scenarios, rendering conventional post-hoc safeguards ineffective as they fail to interdict unsafe content in real-time. While streaming safeguards based on token-level supervised training could address this, they...

1 min 1 month, 2 weeks ago

ead

LOW Academic United States

MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation

arXiv:2603.02222v1 Announce Type: new Abstract: MedCalc-Bench is a widely used benchmark for evaluating LLM performance on clinical calculator tasks, with state-of-the-art direct prompting scores plateauing around 35% on the Verified split (HELM MedHELM leaderboard) and the best published approach-RL with...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

arXiv:2603.02224v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) has emerged as a parameter-efficient approach for adapting large pre-trained models, yet its behavior under continual learning remains poorly understood. We present a geometric theory characterizing catastrophic forgetting in LoRA through the...

1 min 1 month, 2 weeks ago

ead

LOW Academic International

Scaling Reward Modeling without Human Supervision

arXiv:2603.02225v1 Announce Type: new Abstract: Learning from feedback is an instrumental process for advancing the capabilities and safety of frontier models, yet its effectiveness is often constrained by cost and scalability. We present a pilot study that explores scaling reward...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

arXiv:2603.02228v1 Announce Type: new Abstract: The proof that Large Language Models (LLMs) augmented with external read-write memory constitute a computationally universal system has established the theoretical foundation for general-purpose agents. However, existing implementations face a critical bottleneck: the finite and...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Physics-Informed Neural Networks with Architectural Physics Embedding for Large-Scale Wave Field Reconstruction

arXiv:2603.02231v1 Announce Type: new Abstract: Large-scale wave field reconstruction requires precise solutions but faces challenges with computational efficiency and accuracy. The physics-based numerical methods like Finite Element Method (FEM) provide high accuracy but struggle with large-scale or high-frequency problems due...

1 min 1 month, 2 weeks ago

ead

LOW Academic European Union

Talking with Verifiers: Automatic Specification Generation for Neural Network Verification

arXiv:2603.02235v1 Announce Type: new Abstract: Neural network verification tools currently support only a narrow class of specifications, typically expressed as low-level constraints over raw inputs and outputs. This limitation significantly hinders their adoption and practical applicability across diverse application domains...

1 min 1 month, 2 weeks ago

ead

Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

Hybrid Belief Reinforcement Learning for Efficient Coordinated Spatial Exploration

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

Why Are Linear RNNs More Parallelizable?

Freezing of Gait Prediction using Proactive Agent that Learns from Selected Experience and DDQN Algorithm

Graph Negative Feedback Bias Correction Framework for Adaptive Heterophily Modeling

A Stein Identity for q-Gaussians with Bounded Support

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Fairness Begins with State: Purifying Latent Preferences for Hierarchical Reinforcement Learning in Interactive Recommendation

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization

From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

A Directed Graph Model and Experimental Framework for Design and Study of Time-Dependent Text Visualisation

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

FlashEvaluator: Expanding Search Space with Parallel Evaluation

RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning

NExT-Guard: Training-Free Streaming Safeguard without Token-Level Labels

MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

Scaling Reward Modeling without Human Supervision

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

Physics-Informed Neural Networks with Architectural Physics Embedding for Large-Scale Wave Field Reconstruction

Talking with Verifiers: Automatic Specification Generation for Neural Network Verification

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.