International Law

LOW Academic United States

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

arXiv:2603.03530v1 Announce Type: new Abstract: Frozen self-supervised representations often transfer well with only a few labels across many semantic tasks. We argue that a single geometric quantity, \emph{directional} CDNV (decision-axis variance), sits at the core of two favorable behaviors: strong...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction

arXiv:2603.03531v1 Announce Type: new Abstract: Accurate prediction of terrestrial ecosystem carbon fluxes (e.g., CO$_2$, GPP, and CH$_4$) is essential for understanding the global carbon cycle and managing its impacts. However, prediction remains challenging due to strong spatiotemporal heterogeneity: ecosystem flux...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

arXiv:2603.03535v1 Announce Type: new Abstract: While large language models (LLMs) fine-tuned with lightweight adapters achieve strong performance across diverse tasks, their performance on individual tasks depends on the fine-tuning strategy. Fusing independently trained models with different strengths has shown promise...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

arXiv:2603.03538v1 Announce Type: new Abstract: Large language models with chain-of-thought generation have demonstrated great potential for producing complex mathematical proofs. However, their reasoning can often go astray, leading to increasing interest in formal and learned verifiers. A major challenge in...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Hybrid Belief Reinforcement Learning for Efficient Coordinated Spatial Exploration

arXiv:2603.03595v1 Announce Type: new Abstract: Coordinating multiple autonomous agents to explore and serve spatially heterogeneous demand requires jointly learning unknown spatial patterns and planning trajectories that maximize task performance. Pure model-based approaches provide structured uncertainty estimates but lack adaptive policy...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

arXiv:2603.03597v1 Announce Type: new Abstract: The rapid progress of large language models (LLMs) is increasingly constrained by memory and deployment costs, motivating compression methods for practical deployment. Many state-of-the-art compression pipelines leverage the low-rank structure of trained weight matrices, a...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Riemannian Optimization in Modular Systems

arXiv:2603.03610v1 Announce Type: new Abstract: Understanding how systems built out of modular components can be jointly optimized is an important problem in biology, engineering, and machine learning. The backpropagation algorithm is one such solution and has been instrumental in the...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Why Are Linear RNNs More Parallelizable?

arXiv:2603.03612v1 Announce Type: new Abstract: The community is increasingly exploring linear RNNs (LRNNs) as language models, motivated by their expressive power and parallelizability. While prior work establishes the expressivity benefits of LRNNs over transformers, it is unclear what makes LRNNs...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Adaptive Sensing of Continuous Physical Systems for Machine Learning

arXiv:2603.03650v1 Announce Type: new Abstract: Physical dynamical systems can be viewed as natural information processors: their systems preserve, transform, and disperse input information. This perspective motivates learning not only from data generated by such systems, but also how to measure...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Freezing of Gait Prediction using Proactive Agent that Learns from Selected Experience and DDQN Algorithm

arXiv:2603.03651v1 Announce Type: new Abstract: Freezing of Gait (FOG) is a debilitating motor symptom commonly experienced by individuals with Parkinson's Disease (PD) which often leads to falls and reduced mobility. Timely and accurate prediction of FOG episodes is essential for...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

A Stein Identity for q-Gaussians with Bounded Support

arXiv:2603.03673v1 Announce Type: new Abstract: Stein's identity is a fundamental tool in machine learning with applications in generative models, stochastic optimization, and other problems involving gradients of expectations under Gaussian distributions. Less attention has been paid to problems with non-Gaussian...

1 min 1 month, 2 weeks ago

ear

LOW Academic United States

Why Do Unlearnable Examples Work: A Novel Perspective of Mutual Information

arXiv:2603.03725v1 Announce Type: new Abstract: The volume of freely scraped data on the Internet has driven the tremendous success of deep learning. Along with this comes the growing concern about data privacy and security. Numerous methods for generating unlearnable examples...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

arXiv:2603.03756v1 Announce Type: new Abstract: While large language models (LLMs) show promise in scientific discovery, existing research focuses on inference or feedback-driven training, leaving the direct modeling of the generative reasoning process, $P(\text{hypothesis}|\text{background})$ ($P(h|b)$), unexplored. We demonstrate that directly training...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

LEA: Label Enumeration Attack in Vertical Federated Learning

arXiv:2603.03777v1 Announce Type: new Abstract: A typical Vertical Federated Learning (VFL) scenario involves several participants collaboratively training a machine learning model, where each party has different features for the same samples, with labels held exclusively by one party. Since labels...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Inverse Contextual Bandits without Rewards: Learning from a Non-Stationary Learner via Suffix Imitation

arXiv:2603.03778v1 Announce Type: new Abstract: We study the Inverse Contextual Bandit (ICB) problem, in which a learner seeks to optimize a policy while an observer, who cannot access the learner's rewards and only observes actions, aims to recover the underlying...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

arXiv:2603.03805v1 Announce Type: new Abstract: Relational Databases (RDBs) are the backbone of modern business, yet they lack foundation models comparable to those in text or vision. A key obstacle is that high-quality RDBs are private, scarce and structurally heterogeneous, making...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

arXiv:2603.03818v1 Announce Type: new Abstract: Continual learning is a long-standing challenge in robot policy learning, where a policy must acquire new skills over time without catastrophically forgetting previously learned ones. While prior work has extensively studied continual learning in relatively...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Fairness Begins with State: Purifying Latent Preferences for Hierarchical Reinforcement Learning in Interactive Recommendation

arXiv:2603.03820v1 Announce Type: new Abstract: Interactive recommender systems (IRS) are increasingly optimized with Reinforcement Learning (RL) to capture the sequential nature of user-system dynamics. However, existing fairness-aware methods often suffer from a fundamental oversight: they assume the observed user state...

1 min 1 month, 2 weeks ago

ear

LOW Academic European Union

Large-Margin Hyperdimensional Computing: A Learning-Theoretical Perspective

arXiv:2603.03830v1 Announce Type: new Abstract: Overparameterized machine learning (ML) methods such as neural networks may be prohibitively resource intensive for devices with limited computational capabilities. Hyperdimensional computing (HDC) is an emerging resource efficient and low-complexity ML method that allows hardware...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

Structure-Aware Distributed Backdoor Attacks in Federated Learning

arXiv:2603.03865v1 Announce Type: new Abstract: While federated learning protects data privacy, it also makes the model update process vulnerable to long-term stealthy perturbations. Existing studies on backdoor attacks in federated learning mainly focus on trigger design or poisoning strategies, typically...

1 min 1 month, 2 weeks ago

ear

LOW Academic International

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

arXiv:2603.02684v1 Announce Type: new Abstract: Subtle and indirect hate speech remains an underexplored challenge in online safety research, particularly when harmful intent is embedded within misleading or manipulative narratives. Existing hate speech datasets primarily capture overt toxicity, underrepresenting the nuanced...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization

arXiv:2603.02701v1 Announce Type: new Abstract: Optimizing communication topology is fundamental to the efficiency and effectiveness of Large Language Model (LLM)-based Multi-Agent Systems (MAS). While recent approaches utilize reinforcement learning to dynamically construct task-specific graphs, they typically rely on single-sample policy...

1 min 1 month, 3 weeks ago

ear

LOW Academic European Union

Sensory-Aware Sequential Recommendation via Review-Distilled Representations

arXiv:2603.02709v1 Announce Type: new Abstract: We propose a novel framework for sensory-aware sequential recommendation that enriches item representations with linguistically extracted sensory attributes from product reviews. Our approach, \textsc{ASEGR} (Attribute-based Sensory Enhanced Generative Recommendation), introduces a two-stage pipeline in which...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

arXiv:2603.02789v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) enhance the potential of natural language processing. However, their actual impact on document information extraction remains unclear. In particular, it is unclear whether an MLLM-only pipeline--while simpler--can truly match the...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Faster, Cheaper, More Accurate: Specialised Knowledge Tracing Models Outperform LLMs

arXiv:2603.02830v1 Announce Type: new Abstract: Predicting future student responses to questions is particularly valuable for educational learning platforms where it enables effective interventions. One of the key approaches to do this has been through the use of knowledge tracing (KT)...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

A Browser-based Open Source Assistant for Multimodal Content Verification

arXiv:2603.02842v1 Announce Type: new Abstract: Disinformation and false content produced by generative AI pose a significant challenge for journalists and fact-checkers who must rapidly verify digital media information. While there is an abundance of NLP models for detecting credibility signals...

1 min 1 month, 3 weeks ago

ear

LOW Academic United States

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

arXiv:2603.02865v1 Announce Type: new Abstract: Large vision-language models (LVLMs) demonstrate strong performance on diagram understanding benchmarks, yet they still struggle with understanding relationships between elements, particularly those represented by nodes and directed edges (e.g., arrows and lines). To investigate the...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

arXiv:2603.02909v1 Announce Type: new Abstract: Document-level event argument extraction (DEAE) is essential for knowledge acquisition, aiming to extract participants of events from documents.In the zero-shot setting, existing methods employ LLMs to generate synthetic data to address the challenge posed by...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling

arXiv:2603.03001v1 Announce Type: new Abstract: Self attention encoders such as Bidirectional Encoder Representations from Transformers(BERT) scale quadratically with sequence length, making long context modeling expensive. Linear time state space models, such as Mamba, are efficient; however, they show limitations in...

1 min 1 month, 3 weeks ago

ear

LOW Academic International

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

arXiv:2603.03054v1 Announce Type: new Abstract: Large language models are increasingly used for patient-facing medical assistance and clinical decision support, but adapting them to clinical dialogue often requires supervision derived from doctor-patient conversations that may contain sensitive information. Conventional supervised fine-tuning...

1 min 1 month, 3 weeks ago

ear

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction

Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

Hybrid Belief Reinforcement Learning for Efficient Coordinated Spatial Exploration

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

Riemannian Optimization in Modular Systems

Why Are Linear RNNs More Parallelizable?

Adaptive Sensing of Continuous Physical Systems for Machine Learning

Freezing of Gait Prediction using Proactive Agent that Learns from Selected Experience and DDQN Algorithm

A Stein Identity for q-Gaussians with Bounded Support

Why Do Unlearnable Examples Work: A Novel Perspective of Mutual Information

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

LEA: Label Enumeration Attack in Vertical Federated Learning

Inverse Contextual Bandits without Rewards: Learning from a Non-Stationary Learner via Suffix Imitation

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Fairness Begins with State: Purifying Latent Preferences for Hierarchical Reinforcement Learning in Interactive Recommendation

Large-Margin Hyperdimensional Computing: A Learning-Theoretical Perspective

Structure-Aware Distributed Backdoor Attacks in Federated Learning

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization

Sensory-Aware Sequential Recommendation via Review-Distilled Representations

OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

Faster, Cheaper, More Accurate: Specialised Knowledge Tracing Models Outperform LLMs

A Browser-based Open Source Assistant for Multimodal Content Verification

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.