Arbitration

LOW Academic European Union

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

arXiv:2603.00040v1 Announce Type: new Abstract: Achieving reliable 4-bit attention is a prerequisite for end-to-end FP4 computation on emerging FP4-capable GPUs, yet attention remains the main obstacle due to FP4's tiny dynamic range and attention's heavy-tailed activations. This paper presents the...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment

arXiv:2603.00042v1 Announce Type: new Abstract: We identify the Spectral Energy Gain in extreme model compression, where low-rank binary approximations outperform tiny-rank floating-point baselines for heavy-tailed spectra. However, prior attempts fail to realize this potential, trailing state-of-the-art 1-bit methods. We attribute...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

Breaking the Factorization Barrier in Diffusion Language Models

arXiv:2603.00045v1 Announce Type: new Abstract: Diffusion language models theoretically allow for efficient parallel generation but are practically hindered by the "factorization barrier": the assumption that simultaneously predicted tokens are independent. This limitation forces a trade-off: models must either sacrifice speed...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

REMIND: Rethinking Medical High-Modality Learning under Missingness--A Long-Tailed Distribution Perspective

arXiv:2603.00046v1 Announce Type: new Abstract: Medical multi-modal learning is critical for integrating information from a large set of diverse modalities. However, when leveraging a high number of modalities in real clinical applications, it is often impractical to obtain full-modality observations...

1 min 1 month, 2 weeks ago

bit

LOW Academic United Kingdom

Knowledge-guided generative surrogate modeling for high-dimensional design optimization under scarce data

arXiv:2603.00052v1 Announce Type: new Abstract: Surrogate models are widely used in mechanical design and manufacturing process optimization, where high-fidelity computational models may be unavailable or prohibitively expensive. Their effectiveness, however, is often limited by data scarcity, as purely data-driven surrogates...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

Mag-Mamba: Modeling Coupled spatiotemporal Asymmetry for POI Recommendation

arXiv:2603.00053v1 Announce Type: new Abstract: Next Point-of-Interest (POI) recommendation is a critical task in location-based services, yet it faces the fundamental challenge of coupled spatiotemporal asymmetry inherent in urban mobility. Specifically, transition intents between locations exhibit high asymmetry and are...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

Expert Divergence Learning for MoE-based Language Models

arXiv:2603.00054v1 Announce Type: new Abstract: The Mixture-of-Experts (MoE) architecture is a powerful technique for scaling language models, yet it often suffers from expert homogenization, where experts learn redundant functionalities, thereby limiting MoE's full potential. To address this, we introduce Expert...

1 min 1 month, 2 weeks ago

bit

LOW Academic European Union

Wideband Power Amplifier Behavioral Modeling Using an Amplitude Conditioned LSTM

arXiv:2603.00101v1 Announce Type: new Abstract: Wideband power amplifiers exhibit complex nonlinear and memory effects that challenge traditional behavioral modeling approaches. This paper proposes a novel amplitude conditioned long short-term memory (AC-LSTM) network that introduces explicit amplitude-dependent gating to enhance the...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

MAML-KT: Addressing Cold Start Problem in Knowledge Tracing for New Students via Few-Shot Model-Agnostic Meta Learning

arXiv:2603.00137v1 Announce Type: new Abstract: Knowledge tracing (KT) models are commonly evaluated by training on early interactions from all students and testing on later responses. While effective for measuring average predictive performance, this evaluation design obscures a cold start scenario...

1 min 1 month, 2 weeks ago

bit

LOW Academic European Union

Diagnostics for Individual-Level Prediction Instability in Machine Learning for Healthcare

arXiv:2603.00192v1 Announce Type: new Abstract: In healthcare, predictive models increasingly inform patient-level decisions, yet little attention is paid to the variability in individual risk estimates and its impact on treatment decisions. For overparameterized models, now standard in machine learning, a...

1 min 1 month, 2 weeks ago

bit

LOW Academic European Union

Scalable Gaussian process modeling of parametrized spatio-temporal fields

arXiv:2603.00290v1 Announce Type: new Abstract: We introduce a scalable Gaussian process (GP) framework with deep product kernels for data-driven learning of parametrized spatio-temporal fields over fixed or parameter-dependent domains. The proposed framework learns a continuous representation, enabling predictions at arbitrary...

1 min 1 month, 2 weeks ago

bit

LOW Academic European Union

Polynomial Surrogate Training for Differentiable Ternary Logic Gate Networks

arXiv:2603.00302v1 Announce Type: new Abstract: Differentiable logic gate networks (DLGNs) learn compact, interpretable Boolean circuits via gradient-based training, but all existing variants are restricted to the 16 two-input binary gates. Extending DLGNs to Ternary Kleene $K_3$ logic and training DTLGNs...

1 min 1 month, 2 weeks ago

adr

LOW Academic International

Physics-Aware Learnability: From Set-Theoretic Independence to Operational Constraints

arXiv:2603.00417v1 Announce Type: new Abstract: Beyond binary classification, learnability can become a logically fragile notion: in EMX, even the class of all finite subsets of $[0,1]$ is learnable in some models of ZFC and not in others. We argue the...

1 min 1 month, 2 weeks ago

bit

LOW Conference International

2026 Expo Schedule

1 min 1 month, 2 weeks ago

bit

LOW Academic International

LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

arXiv:2602.23610v1 Announce Type: new Abstract: The reasoning capability of large language models (LLMs), defined as their ability to analyze, infer, and make decisions based on input information, is essential for building intelligent task-oriented dialogue systems. However, existing benchmarks do not...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games

arXiv:2602.24188v1 Announce Type: new Abstract: We present a scalable methodology for evaluating language models in multi-turn interactions, using a suite of collaborative games that require effective communication about private information. This enables an interactive scaling analysis, in which a fixed...

1 min 1 month, 2 weeks ago

adr

LOW Academic International

HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit

arXiv:2602.23699v1 Announce Type: cross Abstract: The quadratic computational cost of processing vision tokens in Multimodal Large Language Models (MLLMs) hinders their widespread adoption. While progressive vision token pruning offers a promising solution, current methods misinterpret shallow layer functions and use...

1 min 1 month, 2 weeks ago

adr

LOW Academic European Union

Neural Operators Can Discover Functional Clusters

arXiv:2602.23528v1 Announce Type: new Abstract: Operator learning is reshaping scientific computing by amortizing inference across infinite families of problems. While neural operators (NOs) are increasingly well understood for regression, far less is known for classification and its unsupervised analogue: clustering....

1 min 1 month, 2 weeks ago

bit

LOW Academic European Union

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents

arXiv:2602.23556v1 Announce Type: new Abstract: Large-scale Graph Neural Networks (GNNs) are typically trained by sampling a vertex's neighbors to a fixed distance. Because large input graphs are distributed, training requires frequent irregular communication that stalls forward progress. Moreover, fetched data...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

Dynamics of Learning under User Choice: Overspecialization and Peer-Model Probing

arXiv:2602.23565v1 Announce Type: new Abstract: In many economically relevant contexts where machine learning is deployed, multiple platforms obtain data from the same pool of users, each of whom selects the platform that best serves them. Prior work in this setting...

1 min 1 month, 2 weeks ago

bit

LOW Academic European Union

Normalisation and Initialisation Strategies for Graph Neural Networks in Blockchain Anomaly Detection

arXiv:2602.23599v1 Announce Type: new Abstract: Graph neural networks (GNNs) offer a principled approach to financial fraud detection by jointly learning from node features and transaction graph topology. However, their effectiveness on real-world anti-money laundering (AML) benchmarks depends critically on training...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

arXiv:2602.23636v1 Announce Type: new Abstract: Ensuring the safety of LLM-generated content is essential for real-world deployment. Most existing guardrail models formulate moderation as a fixed binary classification task, implicitly assuming a fixed definition of harmfulness. In practice, enforcement strictness -...

1 min 1 month, 2 weeks ago

enforcement

LOW Academic International

Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training

arXiv:2602.23696v1 Announce Type: new Abstract: We study the geometry of training trajectories in small transformer models and find that parameter updates organize into a dominant drift direction with transverse residual dynamics. Using uncentered, row-normalized trajectory PCA, we show that a...

1 min 1 month, 2 weeks ago

bit

LOW Academic United States

MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models

arXiv:2602.23798v1 Announce Type: new Abstract: Machine unlearning for large language models often faces a privacy dilemma in which strict constraints prohibit sharing either the server's parameters or the client's forget set. To address this dual non-disclosure constraint, we propose MPU,...

1 min 1 month, 2 weeks ago

bit

LOW Academic European Union

Intrinsic Lorentz Neural Network

arXiv:2602.23981v1 Announce Type: new Abstract: Real-world data frequently exhibit latent hierarchical structures, which can be naturally represented by hyperbolic geometry. Although recent hyperbolic neural networks have demonstrated promising results, many existing architectures remain partially intrinsic, mixing Euclidean operations with hyperbolic...

1 min 1 month, 2 weeks ago

bit

LOW News United States

Court sides with parents in dispute over California policies on transgender students

The Supreme Court on Monday night granted a request from a group of California parents to reinstate a ruling by a federal district court that prohibits schools in that state […]The postCourt sides with parents in dispute over California policies...

1 min 1 month, 2 weeks ago

bit

LOW News United States

Supreme Court skeptical of law banning drug users from possessing firearms

The Supreme Court on Monday was skeptical that the indictment of a Texas man on charges that he violated a federal law prohibiting the possession of a gun by the […]The postSupreme Court skeptical of law banning drug users from...

1 min 1 month, 2 weeks ago

bit

LOW News United States

Trump FCC's equal-time crackdown doesn't apply equally—or at all—to talk radio

FCC Chairman Brendan Carr's unequal enforcement of the equal-time rule.

1 min 1 month, 2 weeks ago

enforcement

LOW Law Review United States

Right Diagnosis, Wrong Cure: Reconceptualizing the Commerce Clause Basis for the Federal Prohibition on Felon Firearm Possession

Introduction Jonathan Adler recently posted the provocative piece: “Is the Federal Prohibition on Felon Firearm Possession Constitutional?”[1] Although Second Amendment challenges are all the rage, Adler instead asks about Congress’s commerce power. This Essay takes up Adler’s challenge to reconceptualize...

1 min 1 month, 2 weeks ago

bit

LOW Academic International

PreScience: A Benchmark for Forecasting Scientific Contributions

arXiv:2602.20459v1 Announce Type: new Abstract: Can AI systems trained on the scientific record up to a fixed point in time forecast the scientific advances that follow? Such a capability could help researchers identify collaborators and impactful research directions, and anticipate...

1 min 1 month, 2 weeks ago

adr

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment

Breaking the Factorization Barrier in Diffusion Language Models

REMIND: Rethinking Medical High-Modality Learning under Missingness--A Long-Tailed Distribution Perspective

Knowledge-guided generative surrogate modeling for high-dimensional design optimization under scarce data

Mag-Mamba: Modeling Coupled spatiotemporal Asymmetry for POI Recommendation

Expert Divergence Learning for MoE-based Language Models

Wideband Power Amplifier Behavioral Modeling Using an Amplitude Conditioned LSTM

MAML-KT: Addressing Cold Start Problem in Knowledge Tracing for New Students via Few-Shot Model-Agnostic Meta Learning

Diagnostics for Individual-Level Prediction Instability in Machine Learning for Healthcare

Scalable Gaussian process modeling of parametrized spatio-temporal fields

Polynomial Surrogate Training for Differentiable Ternary Logic Gate Networks

Physics-Aware Learnability: From Set-Theoretic Independence to Operational Constraints

2026 Expo Schedule

LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games

HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit

Neural Operators Can Discover Functional Clusters

Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents

Dynamics of Learning under User Choice: Overspecialization and Peer-Model Probing

Normalisation and Initialisation Strategies for Graph Neural Networks in Blockchain Anomaly Detection

FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training

MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models

Intrinsic Lorentz Neural Network

Court sides with parents in dispute over California policies on transgender students

Supreme Court skeptical of law banning drug users from possessing firearms

Trump FCC's equal-time crackdown doesn't apply equally—or at all—to talk radio

Right Diagnosis, Wrong Cure: Reconceptualizing the Commerce Clause Basis for the Federal Prohibition on Felon Firearm Possession

PreScience: A Benchmark for Forecasting Scientific Contributions

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.