AI & Technology Law

LOW Academic European Union

RADAR: Learning to Route with Asymmetry-aware DistAnce Representations

arXiv:2603.03388v1 Announce Type: new Abstract: Recent neural solvers have achieved strong performance on vehicle routing problems (VRPs), yet they mainly assume symmetric Euclidean distances, restricting applicability to real-world scenarios. A core challenge is encoding the relational features in asymmetric distance...

1 min 1 month, 3 weeks ago

ai

LOW Academic European Union

A Short Note on a Variant of the Squint Algorithm

arXiv:2603.03409v1 Announce Type: new Abstract: This short note describes a simple variant of the Squint algorithm of Koolen and Van Erven [2015] for the classic expert problem. Via an equally simple modification of their proof, we prove that this variant...

1 min 1 month, 3 weeks ago

algorithm

LOW Academic International

Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer's MLP Budget

arXiv:2603.03459v1 Announce Type: new Abstract: We investigate when transformer MLP nonlinearity is actually necessary. A gate with $d+1$ parameters decides when to replace the full MLP with a linear surrogate. Through systematic investigation across six models (162M-2.8B parameters), two architectures,...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

arXiv:2603.03475v1 Announce Type: new Abstract: Mathematical reasoning models are widely deployed in education, automated tutoring, and decision support systems despite exhibiting fundamental computational instabilities. We demonstrate that state-of-the-art models (Qwen2.5-Math-7B) achieve 61% accuracy through a mixture of reliable and unreliable...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning

arXiv:2603.03480v1 Announce Type: new Abstract: We study reinforcement learning with delayed state observation, where the agent observes the current state after some random number of time steps. We propose an algorithm that combines the augmentation method and the upper confidence...

1 min 1 month, 3 weeks ago

algorithm

LOW Academic European Union

Optimal trajectory-guided stochastic co-optimization for e-fuel system design and real-time operation

arXiv:2603.03484v1 Announce Type: new Abstract: E-fuels are promising long-term energy carriers supporting the net-zero transition. However, the large combinatorial design-operation spaces under renewable uncertainty make the use of mathematical programming impractical for co-optimizing e-fuel production systems. Here, we present MasCOR,...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Orbital Transformers for Predicting Wavefunctions in Time-Dependent Density Functional Theory

arXiv:2603.03511v1 Announce Type: new Abstract: We aim to learn wavefunctions simulated by time-dependent density functional theory (TDDFT), which can be efficiently represented as linear combination coefficients of atomic orbitals. In real-time TDDFT, the electronic wavefunctions of a molecule evolve over...

1 min 1 month, 3 weeks ago

ai

LOW Academic United States

Logit-Level Uncertainty Quantification in Vision-Language Models for Histopathology Image Analysis

arXiv:2603.03527v1 Announce Type: new Abstract: Vision-Language Models (VLMs) with their multimodal capabilities have demonstrated remarkable success in almost all domains, including education, transportation, healthcare, energy, finance, law, and retail. Nevertheless, the utilization of VLMs in healthcare applications raises crucial concerns...

1 min 1 month, 3 weeks ago

ai

LOW Academic United States

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

arXiv:2603.03530v1 Announce Type: new Abstract: Frozen self-supervised representations often transfer well with only a few labels across many semantic tasks. We argue that a single geometric quantity, \emph{directional} CDNV (decision-axis variance), sits at the core of two favorable behaviors: strong...

1 min 1 month, 3 weeks ago

ai

LOW Academic United States

Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction

arXiv:2603.03531v1 Announce Type: new Abstract: Accurate prediction of terrestrial ecosystem carbon fluxes (e.g., CO$_2$, GPP, and CH$_4$) is essential for understanding the global carbon cycle and managing its impacts. However, prediction remains challenging due to strong spatiotemporal heterogeneity: ecosystem flux...

1 min 1 month, 3 weeks ago

ai

LOW Academic European Union

Extending Neural Operators: Robust Handling of Functions Beyond the Training Set

arXiv:2603.03621v1 Announce Type: new Abstract: We develop a rigorous framework for extending neural operators to handle out-of-distribution input functions. We leverage kernel approximation techniques and provide theory for characterizing the input-output function spaces in terms of Reproducing Kernel Hilbert Spaces...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Harmonic Dataset Distillation for Time Series Forecasting

arXiv:2603.03760v1 Announce Type: new Abstract: Time Series forecasting (TSF) in the modern era faces significant computational and storage cost challenges due to the massive scale of real-world data. Dataset Distillation (DD), a paradigm that synthesizes a small, compact dataset to...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

When and Where to Reset Matters for Long-Term Test-Time Adaptation

arXiv:2603.03796v1 Announce Type: new Abstract: When continual test-time adaptation (TTA) persists over the long term, errors accumulate in the model and further cause it to predict only a few classes for all inputs, a phenomenon known as model collapse. Recent...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

arXiv:2603.03805v1 Announce Type: new Abstract: Relational Databases (RDBs) are the backbone of modern business, yet they lack foundation models comparable to those in text or vision. A key obstacle is that high-quality RDBs are private, scarce and structurally heterogeneous, making...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

arXiv:2603.03818v1 Announce Type: new Abstract: Continual learning is a long-standing challenge in robot policy learning, where a policy must acquire new skills over time without catastrophically forgetting previously learned ones. While prior work has extensively studied continual learning in relatively...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

arXiv:2603.02684v1 Announce Type: new Abstract: Subtle and indirect hate speech remains an underexplored challenge in online safety research, particularly when harmful intent is embedded within misleading or manipulative narratives. Existing hate speech datasets primarily capture overt toxicity, underrepresenting the nuanced...

1 min 1 month, 3 weeks ago

ai

LOW Academic European Union

Sensory-Aware Sequential Recommendation via Review-Distilled Representations

arXiv:2603.02709v1 Announce Type: new Abstract: We propose a novel framework for sensory-aware sequential recommendation that enriches item representations with linguistically extracted sensory attributes from product reviews. Our approach, \textsc{ASEGR} (Attribute-based Sensory Enhanced Generative Recommendation), introduces a two-stage pipeline in which...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

The Distribution of Phoneme Frequencies across the World's Languages: Macroscopic and Microscopic Information-Theoretic Models

arXiv:2603.02860v1 Announce Type: new Abstract: We demonstrate that the frequency distribution of phonemes across languages can be explained at both macroscopic and microscopic levels. Macroscopically, phoneme rank-frequency distributions closely follow the order statistics of a symmetric Dirichlet distribution whose single...

1 min 1 month, 3 weeks ago

ai

LOW Academic United States

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

arXiv:2603.02865v1 Announce Type: new Abstract: Large vision-language models (LVLMs) demonstrate strong performance on diagram understanding benchmarks, yet they still struggle with understanding relationships between elements, particularly those represented by nodes and directed edges (e.g., arrows and lines). To investigate the...

1 min 1 month, 3 weeks ago

ai

LOW Academic European Union

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

arXiv:2603.02945v1 Announce Type: new Abstract: Model merging aims to combine multiple task-specific expert models into a single model while preserving generalization across diverse tasks. However, interference among experts, especially when they are trained on different objectives, often leads to significant...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling

arXiv:2603.03001v1 Announce Type: new Abstract: Self attention encoders such as Bidirectional Encoder Representations from Transformers(BERT) scale quadratically with sequence length, making long context modeling expensive. Linear time state space models, such as Mamba, are efficient; however, they show limitations in...

1 min 1 month, 3 weeks ago

ai

LOW Academic European Union

UniSkill: A Dataset for Matching University Curricula to Professional Competencies

arXiv:2603.03134v1 Announce Type: new Abstract: Skill extraction and recommendation systems have been studied from recruiter, applicant, and education perspectives. While AI applications in job advertisements have received broad attention, deficiencies in the instructed skills side remain a challenge. In this...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

arXiv:2603.03194v1 Announce Type: new Abstract: Current benchmarks for code agents primarily assess narrow, repository-specific fixes, overlooking critical real-world challenges such as cross-repository reasoning, domain-specialized problem solving, dependency-driven migration, and full-repository generation. To address this gap, we introduce BeyondSWE, a comprehensive...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

arXiv:2603.03205v1 Announce Type: new Abstract: Agentic language models operate in a fundamentally different safety regime than chat models: they must plan, call tools, and execute long-horizon actions where a single misstep, such as accessing files or entering credentials, can cause...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

arXiv:2603.02227v1 Announce Type: cross Abstract: Can a transformer learn which attention entries matter during training? In principle, yes: attention distributions are highly concentrated, and a small gate network can identify the important entries post-hoc with near-perfect accuracy. In practice, barely....

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

arXiv:2603.02556v1 Announce Type: cross Abstract: Reasoning has emerged as a key capability of large language models. In linguistic tasks, this capability can be enhanced by self-improving techniques that refine reasoning paths for subsequent finetuning. However, extending these language-based self-improving approaches...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

FlashEvaluator: Expanding Search Space with Parallel Evaluation

arXiv:2603.02565v1 Announce Type: cross Abstract: The Generator-Evaluator (G-E) framework, i.e., evaluating K sequences from a generator and selecting the top-ranked one according to evaluator scores, is a foundational paradigm in tasks such as Recommender Systems (RecSys) and Natural Language Processing...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Is Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression

arXiv:2603.02217v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models scale capacity efficiently, but their massive parameter footprint creates a deployment-time memory bottleneck. We organize retraining-free MoE compression into three paradigms - Expert Pruning, Expert Editing, and Expert Merging - and show...

1 min 1 month, 3 weeks ago

ai

LOW Academic United States

Forecasting as Rendering: A 2D Gaussian Splatting Framework for Time Series Forecasting

arXiv:2603.02220v1 Announce Type: new Abstract: Time series forecasting (TSF) remains a challenging problem due to the intricate entanglement of intraperiod-fluctuations and interperiod-trends. While recent advances have attempted to reshape 1D sequences into 2D period-phase representations, they suffer from two principal...

1 min 1 month, 3 weeks ago

ai

LOW Academic International

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

arXiv:2603.02224v1 Announce Type: new Abstract: Low-Rank Adaptation (LoRA) has emerged as a parameter-efficient approach for adapting large pre-trained models, yet its behavior under continual learning remains poorly understood. We present a geometric theory characterizing catastrophic forgetting in LoRA through the...

1 min 1 month, 3 weeks ago

ai

RADAR: Learning to Route with Asymmetry-aware DistAnce Representations

A Short Note on a Variant of the Squint Algorithm

Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer's MLP Budget

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning

Optimal trajectory-guided stochastic co-optimization for e-fuel system design and real-time operation

Orbital Transformers for Predicting Wavefunctions in Time-Dependent Density Functional Theory

Logit-Level Uncertainty Quantification in Vision-Language Models for Histopathology Image Analysis

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction

Extending Neural Operators: Robust Handling of Functions Beyond the Training Set

Harmonic Dataset Distillation for Time Series Forecasting

When and Where to Reset Matters for Long-Term Test-Time Adaptation

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

HateMirage: An Explainable Multi-Dimensional Dataset for Decoding Faux Hate and Subtle Online Abuse

Sensory-Aware Sequential Recommendation via Review-Distilled Representations

The Distribution of Phoneme Frequencies across the World's Languages: Macroscopic and Microscopic Information-Theoretic Models

Nodes Are Early, Edges Are Late: Probing Diagram Representations in Large Vision-Language Models

ACE-Merging: Data-Free Model Merging with Adaptive Covariance Estimation

MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling

UniSkill: A Dataset for Matching University Curricula to Professional Competencies

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

FlashEvaluator: Expanding Search Space with Parallel Evaluation

Is Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression

Forecasting as Rendering: A 2D Gaussian Splatting Framework for Time Series Forecasting

Subspace Geometry Governs Catastrophic Forgetting in Low-Rank Adaptation

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.