International Law

LOW Academic European Union

AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization

arXiv:2603.23566v1 Announce Type: new Abstract: AscendC (Ascend C) operator optimization on Huawei Ascend neural processing units (NPUs) faces a two-fold knowledge bottleneck: unlike the CUDA ecosystem, there are few public reference implementations to learn from, and performance hinges on a...

1 min 3 weeks, 5 days ago

ear

LOW Academic United States

StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

arXiv:2603.23571v1 Announce Type: new Abstract: Effective navigation intelligence relies on long-term memory to support both immediate generalization and sustained adaptation. However, existing approaches face a dilemma: modular systems rely on explicit mapping but lack flexibility, while Transformer-based end-to-end models are...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Dual-Criterion Curriculum Learning: Application to Temporal Data

arXiv:2603.23573v1 Announce Type: new Abstract: Curriculum Learning (CL) is a meta-learning paradigm that trains a model by feeding the data instances incrementally according to a schedule, which is based on difficulty progression. Defining meaningful difficulty assessment measures is crucial and...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning

arXiv:2603.23574v1 Announce Type: new Abstract: Federated Learning (FL), as a popular distributed learning paradigm, has shown outstanding performance in improving computational efficiency and protecting data privacy, and is widely applied in industrial image classification. However, due to its distributed nature,...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

arXiv:2603.23577v1 Announce Type: new Abstract: Large language models (LLMs) generalize smoothly across continuous semantic spaces, yet strict logical reasoning demands the formation of discrete decision boundaries. Prevailing theories relying on linear isometric projections fail to resolve this fundamental tension. In...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Residual Attention Physics-Informed Neural Networks for Robust Multiphysics Simulation of Steady-State Electrothermal Energy Systems

arXiv:2603.23578v1 Announce Type: new Abstract: Efficient thermal management and precise field prediction are critical for the design of advanced energy systems, including electrohydrodynamic transport, microfluidic energy harvesters, and electrically driven thermal regulators. However, the steady-state simulation of these electrothermal coupled...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

arXiv:2603.23580v1 Announce Type: new Abstract: Existing LLM-based Kubernetes diagnostic systems cannot learn from operational experience, operating on static knowledge bases without improving from past resolutions. We present MetaKube, an experience-aware LLM framework through three synergistic innovations: (1) an Episodic Pattern...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

AI Generalisation Gap In Comorbid Sleep Disorder Staging

arXiv:2603.23582v1 Announce Type: new Abstract: Accurate sleep staging is essential for diagnosing OSA and hypopnea in stroke patients. Although PSG is reliable, it is costly, labor-intensive, and manually scored. While deep learning enables automated EEG-based sleep staging in healthy subjects,...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

LineMVGNN: Anti-Money Laundering with Line-Graph-Assisted Multi-View Graph Neural Networks

arXiv:2603.23584v1 Announce Type: new Abstract: Anti-money laundering (AML) systems are important for protecting the global economy. However, conventional rule-based methods rely on domain knowledge, leading to suboptimal accuracy and a lack of scalability. Graph neural networks (GNNs) for digraphs (directed...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Steering Code LLMs with Activation Directions for Language and Library Control

arXiv:2603.23629v1 Announce Type: new Abstract: Code LLMs often default to particular programming languages and libraries under neutral prompts. We investigate whether these preferences are encoded as approximately linear directions in activation space that can be manipulated at inference time. Using...

1 min 3 weeks, 5 days ago

ear

LOW Academic United States

Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection

arXiv:2603.23658v1 Announce Type: new Abstract: Gradient boosting, a method of building additive ensembles from weak learners, has established itself as a practical and theoretically-motivated approach to approximate functions, especially using decision tree weak learners. Comparable methods for smooth parametric learners,...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records

arXiv:2603.23719v1 Announce Type: new Abstract: Electronic health records (EHRs) are invaluable for clinical research, yet privacy concerns severely restrict data sharing. Synthetic data generation offers a promising solution, but EHRs present unique challenges: they contain both numerical and categorical features...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

BXRL: Behavior-Explainable Reinforcement Learning

arXiv:2603.23738v1 Announce Type: new Abstract: A major challenge of Reinforcement Learning is that agents often learn undesired behaviors that seem to defy the reward structure they were given. Explainable Reinforcement Learning (XRL) methods can answer queries such as "explain this...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

Self Paced Gaussian Contextual Reinforcement Learning

arXiv:2603.23755v1 Announce Type: new Abstract: Curriculum learning improves reinforcement learning (RL) efficiency by sequencing tasks from simple to complex. However, many self-paced curriculum methods rely on computationally expensive inner-loop optimizations, limiting their scalability in high-dimensional context spaces. In this paper,...

1 min 3 weeks, 5 days ago

ear

LOW Academic United States

Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

arXiv:2603.23783v1 Announce Type: new Abstract: Adapting large-scale foundation models to new domains with limited supervision remains a fundamental challenge due to latent distribution mismatch, unstable optimization dynamics, and miscalibrated uncertainty propagation. This paper introduces an uncertainty-aware probabilistic latent transport framework...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Arithmetic

arXiv:2603.23784v1 Announce Type: new Abstract: Grokking-the phenomenon where validation accuracy of neural networks on modular addition of two integers rises long after training data has been memorized-has been characterized in previous works as producing sinusoidal input weight distributions in transformers...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

Manifold Generalization Provably Proceeds Memorization in Diffusion Models

arXiv:2603.23792v1 Announce Type: new Abstract: Diffusion models often generate novel samples even when the learned score is only \emph{coarse} -- a phenomenon not accounted for by the standard view of diffusion training as density estimation. In this paper, we show...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Resolving gradient pathology in physics-informed epidemiological models

arXiv:2603.23799v1 Announce Type: new Abstract: Physics-informed neural networks (PINNs) are increasingly used in mathematical epidemiology to bridge the gap between noisy clinical data and compartmental models, such as the susceptible-exposed-infected-removed (SEIR) model. However, training these hybrid networks is often unstable...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Deep Neural Regression Collapse

arXiv:2603.23805v1 Announce Type: new Abstract: Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent work has extended the definition of neural collapse to regression problems, albeit only measuring the phenomenon at the...

1 min 3 weeks, 5 days ago

ear

LOW Academic United States

Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers

arXiv:2603.23823v1 Announce Type: new Abstract: Knowledge tracing models mastery over interconnected concepts, often organized by prerequisites. We analyze hierarchical prerequisite propagation through a circuit-complexity lens to clarify what is provable about transformer-style computation on deep concept hierarchies. Using recent results...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

arXiv:2603.23831v1 Announce Type: new Abstract: Deep neural networks (DNNs), particularly those using Rectified Linear Unit (ReLU) activation functions, have achieved remarkable success across diverse machine learning tasks, including image recognition, audio processing, and language modeling. Despite this success, the non-convex...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Symbolic--KAN: Kolmogorov-Arnold Networks with Discrete Symbolic Structure for Interpretable Learning

arXiv:2603.23854v1 Announce Type: new Abstract: Symbolic discovery of governing equations is a long-standing goal in scientific machine learning, yet a fundamental trade-off persists between interpretability and scalable learning. Classical symbolic regression methods yield explicit analytic expressions but rely on combinatorial...

1 min 3 weeks, 5 days ago

ear

LOW Academic United States

An Invariant Compiler for Neural ODEs in AI-Accelerated Scientific Simulation

arXiv:2603.23861v1 Announce Type: new Abstract: Neural ODEs are increasingly used as continuous-time models for scientific and sensor data, but unconstrained neural ODEs can drift and violate domain invariants (e.g., conservation laws), yielding physically implausible solutions. In turn, this can compound...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Deep Convolutional Neural Networks for predicting highest priority functional group in organic molecules

arXiv:2603.23862v1 Announce Type: new Abstract: Our work addresses the problem of predicting the highest priority functional group present in an organic molecule. Functional Groups are groups of bound atoms that determine the physical and chemical properties of organic molecules. In...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

arXiv:2603.23867v1 Announce Type: new Abstract: Vision-Language Models (VLMs) have been applied to a wide range of reasoning tasks, yet it remains unclear whether they can reason robustly under distribution shifts. In this paper, we study covariate shifts in which the...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

arXiv:2603.23871v1 Announce Type: new Abstract: Large language models trained with reinforcement learning (RL) for mathematical reasoning face a fundamental challenge: on problems the model cannot solve at all - "cliff" prompts - the RL gradient vanishes entirely, preventing any learning...

1 min 3 weeks, 5 days ago

ear

LOW Academic United States

Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

arXiv:2603.23889v1 Announce Type: new Abstract: When safety is formulated as a limit of cumulative cost, safe reinforcement learning (RL) aims to learn policies that maximize return subject to the cost constraint in data collection and deployment. Off-policy safe RL methods,...

1 min 3 weeks, 5 days ago

ear

LOW Academic International

Optimal Variance-Dependent Regret Bounds for Infinite-Horizon MDPs

arXiv:2603.23926v1 Announce Type: new Abstract: Online reinforcement learning in infinite-horizon Markov decision processes (MDPs) remains less theoretically and algorithmically developed than its episodic counterpart, with many algorithms suffering from high ``burn-in'' costs and failing to adapt to benign instance-specific complexity....

1 min 3 weeks, 5 days ago

ear

LOW Academic International

GRMLR: Knowledge-Enhanced Small-Data Learning for Deep-Sea Cold Seep Stage Inference

arXiv:2603.23961v1 Announce Type: new Abstract: Deep-sea cold seep stage assessment has traditionally relied on costly, high-risk manned submersible operations and visual surveys of macrofauna. Although microbial communities provide a promising and more cost-effective alternative, reliable inference remains challenging because the...

1 min 3 weeks, 5 days ago

ear

LOW Academic European Union

Kirchhoff-Inspired Neural Networks for Evolving High-Order Perception

arXiv:2603.23977v1 Announce Type: new Abstract: Deep learning architectures are fundamentally inspired by neuroscience, particularly the structure of the brain's sensory pathways, and have achieved remarkable success in learning informative data representations. Although these architectures mimic the communication mechanisms of biological...

1 min 3 weeks, 5 days ago

ear

AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization

StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

Dual-Criterion Curriculum Learning: Application to Temporal Data

PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning

The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

Residual Attention Physics-Informed Neural Networks for Robust Multiphysics Simulation of Steady-State Electrothermal Energy Systems

MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

AI Generalisation Gap In Comorbid Sleep Disorder Staging

LineMVGNN: Anti-Money Laundering with Line-Graph-Assisted Multi-View Graph Neural Networks

Steering Code LLMs with Activation Directions for Language and Library Control

Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection

CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records

BXRL: Behavior-Explainable Reinforcement Learning

Self Paced Gaussian Contextual Reinforcement Learning

Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Arithmetic

Manifold Generalization Provably Proceeds Memorization in Diffusion Models

Resolving gradient pathology in physics-informed epidemiological models

Deep Neural Regression Collapse

Circuit Complexity of Hierarchical Knowledge Tracing and Implications for Log-Precision Transformers

Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

Symbolic--KAN: Kolmogorov-Arnold Networks with Discrete Symbolic Structure for Interpretable Learning

An Invariant Compiler for Neural ODEs in AI-Accelerated Scientific Simulation

Deep Convolutional Neural Networks for predicting highest priority functional group in organic molecules

Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

Optimal Variance-Dependent Regret Bounds for Infinite-Horizon MDPs

GRMLR: Knowledge-Enhanced Small-Data Learning for Deep-Sea Cold Seep Stage Inference

Kirchhoff-Inspired Neural Networks for Evolving High-Order Perception

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.