Labor & Employment

LOW Academic International

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

arXiv:2603.03597v1 Announce Type: new Abstract: The rapid progress of large language models (LLMs) is increasingly constrained by memory and deployment costs, motivating compression methods for practical deployment. Many state-of-the-art compression pipelines leverage the low-rank structure of trained weight matrices, a...

1 min 1 month, 2 weeks ago

ada

LOW Academic European Union

Adaptive Sensing of Continuous Physical Systems for Machine Learning

arXiv:2603.03650v1 Announce Type: new Abstract: Physical dynamical systems can be viewed as natural information processors: their systems preserve, transform, and disperse input information. This perspective motivates learning not only from data generated by such systems, but also how to measure...

1 min 1 month, 2 weeks ago

ada

LOW Academic European Union

Graph Negative Feedback Bias Correction Framework for Adaptive Heterophily Modeling

arXiv:2603.03662v1 Announce Type: new Abstract: Graph Neural Networks (GNNs) have emerged as a powerful framework for processing graph-structured data. However, conventional GNNs and their variants are inherently limited by the homophily assumption, leading to degradation in performance on heterophilic graphs....

1 min 1 month, 2 weeks ago

ada

LOW Academic International

LEA: Label Enumeration Attack in Vertical Federated Learning

arXiv:2603.03777v1 Announce Type: new Abstract: A typical Vertical Federated Learning (VFL) scenario involves several participants collaboratively training a machine learning model, where each party has different features for the same samples, with labels held exclusively by one party. Since labels...

1 min 1 month, 2 weeks ago

labor

LOW Academic International

When and Where to Reset Matters for Long-Term Test-Time Adaptation

arXiv:2603.03796v1 Announce Type: new Abstract: When continual test-time adaptation (TTA) persists over the long term, errors accumulate in the model and further cause it to predict only a few classes for all inputs, a phenomenon known as model collapse. Recent...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

arXiv:2603.03805v1 Announce Type: new Abstract: Relational Databases (RDBs) are the backbone of modern business, yet they lack foundation models comparable to those in text or vision. A key obstacle is that high-quality RDBs are private, scarce and structurally heterogeneous, making...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

arXiv:2603.03818v1 Announce Type: new Abstract: Continual learning is a long-standing challenge in robot policy learning, where a policy must acquire new skills over time without catastrophically forgetting previously learned ones. While prior work has extensively studied continual learning in relatively...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration

arXiv:2603.02760v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) have recently attracted significant attention for their ability to enhance diversity, controllability, and parallelism. However, their non-sequential, bidirectionally masked generation makes quality assessment difficult, underscoring the need for effective self-evaluation....

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

arXiv:2603.02909v1 Announce Type: new Abstract: Document-level event argument extraction (DEAE) is essential for knowledge acquisition, aiming to extract participants of events from documents.In the zero-shot setting, existing methods employ LLMs to generate synthetic data to address the challenge posed by...

1 min 1 month, 2 weeks ago

labor

LOW Academic European Union

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

arXiv:2603.02945v1 Announce Type: new Abstract: Model merging aims to combine multiple task-specific expert models into a single model while preserving generalization across diverse tasks. However, interference among experts, especially when they are trained on different objectives, often leads to significant...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

arXiv:2603.03054v1 Announce Type: new Abstract: Large language models are increasingly used for patient-facing medical assistance and clinical decision support, but adapting them to clinical dialogue often requires supervision derived from doctor-patient conversations that may contain sensitive information. Conventional supervised fine-tuning...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

arXiv:2603.02227v1 Announce Type: cross Abstract: Can a transformer learn which attention entries matter during training? In principle, yes: attention distributions are highly concentrated, and a small gate network can identify the important entries post-hoc with near-perfect accuracy. In practice, barely....

1 min 1 month, 2 weeks ago

ada

LOW Academic International

A Directed Graph Model and Experimental Framework for Design and Study of Time-Dependent Text Visualisation

arXiv:2603.02422v1 Announce Type: cross Abstract: Exponential growth in the quantity of digital news, social media, and other textual sources makes it difficult for humans to keep up with rapidly evolving narratives about world events. Various visualisation techniques have been touted...

1 min 1 month, 2 weeks ago

ada

LOW Academic European Union

ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue

arXiv:2603.02216v1 Announce Type: new Abstract: Effective information seeking in multi-turn medical dialogues is critical for accurate diagnosis, especially when dealing with incomplete information. Aligning Large Language Models (LLMs) for these interactive scenarios is challenging due to the uncertainty inherent in...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Is Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression

arXiv:2603.02217v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models scale capacity efficiently, but their massive parameter footprint creates a deployment-time memory bottleneck. We organize retraining-free MoE compression into three paradigms - Expert Pruning, Expert Editing, and Expert Merging - and show...

1 min 1 month, 2 weeks ago

ada

LOW Academic United States

Forecasting as Rendering: A 2D Gaussian Splatting Framework for Time Series Forecasting

arXiv:2603.02220v1 Announce Type: new Abstract: Time series forecasting (TSF) remains a challenging problem due to the intricate entanglement of intraperiod-fluctuations and interperiod-trends. While recent advances have attempted to reshape 1D sequences into 2D period-phase representations, they suffer from two principal...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

arXiv:2603.02224v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) has emerged as a parameter-efficient approach for adapting large pre-trained models, yet its behavior under continual learning remains poorly understood. We present a geometric theory characterizing catastrophic forgetting in LoRA through the...

1 min 1 month, 2 weeks ago

ada

LOW Academic European Union

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

arXiv:2603.02228v1 Announce Type: new Abstract: The proof that Large Language Models (LLMs) augmented with external read-write memory constitute a computationally universal system has established the theoretical foundation for general-purpose agents. However, existing implementations face a critical bottleneck: the finite and...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

Generalized Discrete Diffusion with Self-Correction

arXiv:2603.02230v1 Announce Type: new Abstract: Self-correction is an effective technique for maintaining parallel sampling in discrete diffusion models with minimal performance degradation. Prior work has explored self-correction at inference time or during post-training; however, such approaches often suffer from limited...

1 min 1 month, 2 weeks ago

ada

LOW Academic United States

PRISM: Exploring Heterogeneous Pretrained EEG Foundation Model Transfer to Clinical Differential Diagnosis

arXiv:2603.02268v1 Announce Type: new Abstract: EEG foundation models are typically pretrained on narrow-source clinical archives and evaluated on benchmarks from the same ecosystem, leaving unclear whether representations encode neural physiology or recording-distribution artifacts. We introduce PRISM (Population Representative Invariant Signal...

1 min 1 month, 2 weeks ago

ada

LOW Academic United States

Quantum-Inspired Fine-Tuning for Few-Shot AIGC Detection via Phase-Structured Reparameterization

arXiv:2603.02281v1 Announce Type: new Abstract: Recent studies show that quantum neural networks (QNNs) generalize well in few-shot regimes. To extend this advantage to large-scale tasks, we propose Q-LoRA, a quantum-enhanced fine-tuning scheme that integrates lightweight QNNs into the low-rank adaptation...

1 min 1 month, 2 weeks ago

ada

LOW Academic European Union

Using the SEKF to Transfer NN Models of Dynamical Systems with Limited Data

arXiv:2603.02439v1 Announce Type: new Abstract: Data-driven models of dynamical systems require extensive amounts of training data. For many practical applications, gathering sufficient data is not feasible due to cost or safety concerns. This work uses the Subset Extended Kalman Filter...

1 min 1 month, 2 weeks ago

ada

LOW News United States

The SCOTUS attorney switcheroo

Empirical SCOTUS is a recurring series by Adam Feldman that looks at Supreme Court data, primarily in the form of opinions and oral arguments, to provide insights into the justices’ decision making and […]The postThe SCOTUS attorney switcherooappeared first onSCOTUSblog.

1 min 1 month, 2 weeks ago

ada

LOW Academic International

CIRCUS: Circuit Consensus under Uncertainty via Stability Ensembles

arXiv:2603.00523v1 Announce Type: new Abstract: Mechanistic circuit discovery is notoriously sensitive to arbitrary analyst choices, especially pruning thresholds and feature dictionaries, often yielding brittle "one-shot" explanations with no principled notion of uncertainty. We reframe circuit discovery as an uncertainty-quantification problem...

1 min 1 month, 2 weeks ago

union

LOW Academic International

CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging

arXiv:2603.00573v1 Announce Type: new Abstract: Large language models (LLMs) achieve remarkable performance on diverse downstream and domain-specific tasks via parameter-efficient fine-tuning (PEFT). However, existing PEFT methods, particularly MoE-LoRA architectures, suffer from limited parameter efficiency and coarse-grained adaptation due to the...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

QQ: A Toolkit for Language Identifiers and Metadata

arXiv:2603.00620v1 Announce Type: new Abstract: The growing number of languages considered in multilingual NLP, including new datasets and tasks, poses challenges regarding properly and accurately reporting which languages are used and how. For example, datasets often use different language identifiers;...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

BLUFF: Benchmarking the Detection of False and Synthetic Content across 58 Low-Resource Languages

arXiv:2603.00634v1 Announce Type: new Abstract: Multilingual falsehoods threaten information integrity worldwide, yet detection benchmarks remain confined to English or a few high-resource languages, leaving low-resource linguistic communities without robust defense tools. We introduce BLUFF, a comprehensive benchmark for detecting false...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

arXiv:2603.00669v1 Announce Type: new Abstract: Sustainability disclosure standards (e.g., GRI, SASB, TCFD, IFRS S2) are comprehensive yet lengthy, terminology-dense, and highly cross-referential, hindering structured analysis and downstream use. We present SSKG Hub (Sustainability Standards Knowledge Graph Hub), a research prototype...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

MedGPT-oss: Training a General-Purpose Vision-Language Model for Biomedicine

arXiv:2603.00842v1 Announce Type: new Abstract: Biomedical multimodal assistants have the potential to unify radiology, pathology, and clinical-text reasoning, yet a critical deployment gap remains: top-performing systems are either closed-source or computationally prohibitive, precluding the on-premises deployment required for patient privacy...

1 min 1 month, 2 weeks ago

ada

LOW Academic International

GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

arXiv:2603.01059v1 Announce Type: new Abstract: Recent advances in large language models (LLMs) have enabled increasingly capable chatbots. However, most existing systems focus on single-user settings and do not generalize well to multi-user group chats, where agents require more proactive and...

1 min 1 month, 2 weeks ago

labor

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

Adaptive Sensing of Continuous Physical Systems for Machine Learning

Graph Negative Feedback Bias Correction Framework for Adaptive Heterophily Modeling

LEA: Label Enumeration Attack in Vertical Federated Learning

When and Where to Reset Matters for Long-Term Test-Time Adaptation

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration

Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

A Directed Graph Model and Experimental Framework for Design and Study of Time-Dependent Text Visualisation

ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue

Is Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression

Forecasting as Rendering: A 2D Gaussian Splatting Framework for Time Series Forecasting

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

Neural Paging: Learning Context Management Policies for Turing-Complete Agents

Generalized Discrete Diffusion with Self-Correction

PRISM: Exploring Heterogeneous Pretrained EEG Foundation Model Transfer to Clinical Differential Diagnosis

Quantum-Inspired Fine-Tuning for Few-Shot AIGC Detection via Phase-Structured Reparameterization

Using the SEKF to Transfer NN Models of Dynamical Systems with Limited Data

The SCOTUS attorney switcheroo

CIRCUS: Circuit Consensus under Uncertainty via Stability Ensembles

CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging

QQ: A Toolkit for Language Identifiers and Metadata

BLUFF: Benchmarking the Detection of False and Synthetic Content across 58 Low-Resource Languages

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

MedGPT-oss: Training a General-Purpose Vision-Language Model for Biomedicine

GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.