Arbitration

LOW Academic International

Quantifying Memorization and Privacy Risks in Genomic Language Models

arXiv:2603.08913v1 Announce Type: new Abstract: Genomic language models (GLMs) have emerged as powerful tools for learning representations of DNA sequences, enabling advances in variant prediction, regulatory element identification, and cross-task transfer learning. However, as these models are increasingly trained or...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli Gates

arXiv:2603.08914v1 Announce Type: new Abstract: Over-parameterized neural networks incur prohibitive memory and computational costs for resource-constrained deployment. The Strong Lottery Ticket (SLT) hypothesis suggests that randomly initialized networks contain sparse subnetworks achieving competitive accuracy without weight training. Existing SLT methods,...

1 min 1 month, 1 week ago

bit

LOW Academic United States

The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference

arXiv:2603.08960v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models deliver high quality at low training FLOPs, but this efficiency often vanishes at inference. We identify a double penalty that structurally disadvantages MoE architectures during decoding: first, expert routing fragments microbatches and...

1 min 1 month, 1 week ago

adr

LOW Academic International

Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning

arXiv:2603.09032v1 Announce Type: new Abstract: Scientific machine learning (SciML) is increasingly applied to in-field processing, controlling, and monitoring; however, wide-area sensing, real-time demands, and strict energy and reliability constraints make centralized SciML implementation impractical. Most SciML models assume raw data...

1 min 1 month, 1 week ago

bit

LOW Academic United States

Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation

arXiv:2603.09053v1 Announce Type: new Abstract: Simulation-to-decision learning enables safe policy training in digital environments without risking real-world deployment, and has become essential in mission-critical domains such as supply chains and industrial systems. However, simulators learned from noisy or biased real-world...

1 min 1 month, 1 week ago

bit

LOW Academic International

Overcoming Valid Action Suppression in Unmasked Policy Gradient Algorithms

arXiv:2603.09090v1 Announce Type: new Abstract: In reinforcement learning environments with state-dependent action validity, action masking consistently outperforms penalty-based handling of invalid actions, yet existing theory only shows that masking preserves the policy gradient theorem. We identify a distinct failure mode...

1 min 1 month, 1 week ago

bit

LOW Academic International

Better Bounds for the Distributed Experts Problem

arXiv:2603.09168v1 Announce Type: new Abstract: In this paper, we study the distributed experts problem, where $n$ experts are distributed across $s$ servers for $T$ timesteps. The loss of each expert at each time $t$ is the $\ell_p$ norm of the...

1 min 1 month, 1 week ago

bit

LOW Academic United States

The Radio-Frequency Transformer for Signal Separation

arXiv:2603.09201v1 Announce Type: new Abstract: We study a problem of signal separation: estimating a signal of interest (SOI) contaminated by an unknown non-Gaussian background/interference. Given the training data consisting of examples of SOI and interference, we show how to build...

1 min 1 month, 1 week ago

bit

LOW Academic International

TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection

arXiv:2603.09349v1 Announce Type: new Abstract: A significant number of anomalous nodes in the real world, such as fake news, noncompliant users, malicious transactions, and malicious posts, severely compromises the health of the graph data ecosystem and urgently requires effective identification...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping

arXiv:2603.06923v1 Announce Type: new Abstract: Large language models (LLMs) often exhibit flawed reasoning ability that undermines reliability. Existing approaches to improving reasoning typically treat it as a general and monolithic skill, applying broad training which is inefficient and unable to...

1 min 1 month, 1 week ago

bit

LOW Academic International

A Coin Flip for Safety: LLM Judges Fail to Reliably Measure Adversarial Robustness

arXiv:2603.06594v1 Announce Type: new Abstract: Automated \enquote{LLM-as-a-Judge} frameworks have become the de facto standard for scalable evaluation across natural language processing. For instance, in safety evaluation, these judges are relied upon to evaluate harmfulness in order to benchmark the robustness...

1 min 1 month, 1 week ago

bit

LOW Academic United States

Language Shapes Mental Health Evaluations in Large Language Models

arXiv:2603.06910v1 Announce Type: new Abstract: This study investigates whether large language models (LLMs) exhibit cross-linguistic differences in mental health evaluations. Focusing on Chinese and English, we examine two widely used models, GPT-4o and Qwen3, to assess whether prompt language systematically...

1 min 1 month, 1 week ago

bit

LOW Academic International

Taiwan Safety Benchmark and Breeze Guard: Toward Trustworthy AI for Taiwanese Mandarin

arXiv:2603.07286v1 Announce Type: new Abstract: Global safety models exhibit strong performance across widely used benchmarks, yet their training data rarely captures the cultural and linguistic nuances of Taiwanese Mandarin. This limitation results in systematic blind spots when interpreting region-specific risks...

1 min 1 month, 1 week ago

bit

LOW Academic International

How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection

arXiv:2603.07346v1 Announce Type: new Abstract: Noisy training data can significantly degrade the performance of language-model-based classifiers, particularly in non-topical classification tasks. In this study we designed a methodological framework to assess the impact of denoising. More specifically, we explored a...

1 min 1 month, 1 week ago

bit

LOW Academic United States

Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context

arXiv:2603.07792v1 Announce Type: new Abstract: Large language models (LLMs) increasingly influence global digital ecosystems, yet their potential to perpetuate social and cultural biases remains poorly understood in underrepresented contexts. This study presents a systematic analysis of representational biases in seven...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Switchable Activation Networks

arXiv:2603.06601v1 Announce Type: new Abstract: Deep neural networks, and more recently large-scale generative models such as large language models (LLMs) and large vision-action models (LVAs), achieve remarkable performance across diverse domains, yet their prohibitive computational cost hinders deployment in resource-constrained...

1 min 1 month, 1 week ago

bit

LOW Academic International

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

arXiv:2603.06621v1 Announce Type: new Abstract: Process Reward Models (PRMs) are rapidly becoming the backbone of LLM reasoning pipelines, yet we demonstrate that state-of-the-art PRMs are systematically exploitable under adversarial optimization pressure. To address this, we introduce a three-tiered diagnostic framework...

1 min 1 month, 1 week ago

bit

LOW Academic International

A new Uncertainty Principle in Machine Learning

arXiv:2603.06634v1 Announce Type: new Abstract: Many scientific problems in the context of machine learning can be reduced to the search of polynomial answers in appropriate variables. The Hevisidization of arbitrary polynomial is actually provided by one-and-the same two-layer expression. What...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Geodesic Gradient Descent: A Generic and Learning-rate-free Optimizer on Objective Function-induced Manifolds

arXiv:2603.06651v1 Announce Type: new Abstract: Euclidean gradient descent algorithms barely capture the geometry of objective function-induced hypersurfaces and risk driving update trajectories off the hypersurfaces. Riemannian gradient descent algorithms address these issues but fail to represent complex hypersurfaces via a...

1 min 1 month, 1 week ago

bit

LOW Academic International

Regression Models Meet Foundation Models: A Hybrid-AI Approach to Practical Electricity Price Forecasting

arXiv:2603.06726v1 Announce Type: new Abstract: Electricity market prices exhibit extreme volatility, nonlinearity, and non-stationarity, making accurate forecasting a significant challenge. While cutting-edge time series foundation models (TSFMs) effectively capture temporal dependencies, they typically underutilize cross-variate correlations and non-periodic patterns that...

1 min 1 month, 1 week ago

bit

LOW Academic International

Safe Transformer: An Explicit Safety Bit For Interpretable And Controllable Alignment

arXiv:2603.06727v1 Announce Type: new Abstract: Current safety alignment methods encode safe behavior implicitly within model parameters, creating a fundamental opacity: we cannot easily inspect why a model refuses a request, nor intervene when its safety judgments fail. We propose Safe...

1 min 1 month, 1 week ago

bit

LOW Academic European Union

Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention

arXiv:2603.06738v1 Announce Type: new Abstract: Recent Super-Resolution~(SR) methods mainly adopt Transformers for their strong long-range modeling capability and exceptional representational capacity. However, most SR Transformers rely heavily on relative positional bias~(RPB), which prevents them from leveraging hardware-efficient attention kernels such...

1 min 1 month, 1 week ago

bit

LOW Academic International

Improved Constrained Generation by Bridging Pretrained Generative Models

arXiv:2603.06742v1 Announce Type: new Abstract: Constrained generative modeling is fundamental to applications such as robotic control and autonomous driving, where models must respect physical laws and safety-critical constraints. In real-world settings, these constraints rarely take the form of simple linear...

1 min 1 month, 1 week ago

bit

LOW News United States

In birthright citizenship case, Justice Department urges court to treat an old concept in a new way

Immigration Matters is a recurring series by César Cuauhtémoc García Hernández that analyzes the court’s immigration docket, highlighting emerging legal questions about new policy and enforcement practices. President Donald Trump’s […]The postIn birthright citizenship case, Justice Department urges court to...

1 min 1 month, 1 week ago

enforcement

LOW News United States

SCOTUStoday for Monday, March 9

Just 22% of U.S. registered voters have “a great deal” (7%) or “quite a bit” (15%) of confidence in the Supreme Court, according to a new NBC News poll shared […]The postSCOTUStoday for Monday, March 9appeared first onSCOTUSblog.

1 min 1 month, 1 week ago

bit

LOW Academic International

Spatiotemporal Heterogeneity of AI-Driven Traffic Flow Patterns and Land Use Interaction: A GeoAI-Based Analysis of Multimodal Urban Mobility

arXiv:2603.05581v1 Announce Type: cross Abstract: Urban traffic flow is governed by the complex, nonlinear interaction between land use configuration and spatiotemporally heterogeneous mobility demand. Conventional global regression and time-series models cannot simultaneously capture these multi-scale dynamics across multiple travel modes....

1 min 1 month, 1 week ago

bit

LOW Academic International

Towards Efficient and Stable Ocean State Forecasting: A Continuous-Time Koopman Approach

arXiv:2603.05560v1 Announce Type: cross Abstract: We investigate the Continuous-Time Koopman Autoencoder (CT-KAE) as a lightweight surrogate model for long-horizon ocean state forecasting in a two-layer quasi-geostrophic (QG) system. By projecting nonlinear dynamics into a latent space governed by a linear...

1 min 1 month, 1 week ago

bit

LOW Academic International

Reasoning Models Struggle to Control their Chains of Thought

arXiv:2603.05706v1 Announce Type: new Abstract: Chain-of-thought (CoT) monitoring is a promising tool for detecting misbehaviors and understanding the motivations of modern reasoning models. However, if models can control what they verbalize in their CoT, it could undermine CoT monitorability. To...

1 min 1 month, 1 week ago

bit

LOW Academic International

On the Reliability of AI Methods in Drug Discovery: Evaluation of Boltz-2 for Structure and Binding Affinity Prediction

arXiv:2603.05532v1 Announce Type: cross Abstract: Despite continuing hype about the role of AI in drug discovery, no "AI-discovered drugs" have so far received regulatory approval. Here we assess one of the latest AI based tools in this domain. The ability...

1 min 1 month, 1 week ago

bit

LOW Academic International

An Embodied Companion for Visual Storytelling

arXiv:2603.05511v1 Announce Type: cross Abstract: As artificial intelligence shifts from pure tool for delegation toward agentic collaboration, its use in the arts can shift beyond the exploration of machine autonomy toward synergistic co-creation. While our earlier robotic works utilized automation...

1 min 1 month, 1 week ago

bit

Quantifying Memorization and Privacy Risks in Genomic Language Models

Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli Gates

The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference

Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning

Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation

Overcoming Valid Action Suppression in Unmasked Policy Gradient Algorithms

Better Bounds for the Distributed Experts Problem

The Radio-Frequency Transformer for Signal Separation

TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection

Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping

A Coin Flip for Safety: LLM Judges Fail to Reliably Measure Adversarial Robustness

Language Shapes Mental Health Evaluations in Large Language Models

Taiwan Safety Benchmark and Breeze Guard: Toward Trustworthy AI for Taiwanese Mandarin

How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection

Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context

Switchable Activation Networks

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

A new Uncertainty Principle in Machine Learning

Geodesic Gradient Descent: A Generic and Learning-rate-free Optimizer on Objective Function-induced Manifolds

Regression Models Meet Foundation Models: A Hybrid-AI Approach to Practical Electricity Price Forecasting

Safe Transformer: An Explicit Safety Bit For Interpretable And Controllable Alignment

Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention

Improved Constrained Generation by Bridging Pretrained Generative Models

In birthright citizenship case, Justice Department urges court to treat an old concept in a new way

SCOTUStoday for Monday, March 9

Spatiotemporal Heterogeneity of AI-Driven Traffic Flow Patterns and Land Use Interaction: A GeoAI-Based Analysis of Multimodal Urban Mobility

Towards Efficient and Stable Ocean State Forecasting: A Continuous-Time Koopman Approach

Reasoning Models Struggle to Control their Chains of Thought

On the Reliability of AI Methods in Drug Discovery: Evaluation of Boltz-2 for Structure and Binding Affinity Prediction

An Embodied Companion for Visual Storytelling

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.