International Law

LOW Academic United States

Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context

arXiv:2603.07792v1 Announce Type: new Abstract: Large language models (LLMs) increasingly influence global digital ecosystems, yet their potential to perpetuate social and cultural biases remains poorly understood in underrepresented contexts. This study presents a systematic analysis of representational biases in seven...

1 min 1 month, 1 week ago

ear

LOW Academic International

An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data

arXiv:2603.07841v1 Announce Type: new Abstract: Recent advances in large language models has strengthened Text2SQL systems that translate natural language questions into database queries. A persistent deployment challenge is to assess a newly trained Text2SQL system on an unseen and unlabeled...

1 min 1 month, 1 week ago

ear

LOW Academic International

How Attention Sinks Emerge in Large Language Models: An Interpretability Perspective

arXiv:2603.06591v1 Announce Type: new Abstract: Large Language Models (LLMs) often allocate disproportionate attention to specific tokens, a phenomenon commonly referred to as the attention sink. While such sinks are generally considered detrimental, prior studies have identified a notable exception: the...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

Switchable Activation Networks

arXiv:2603.06601v1 Announce Type: new Abstract: Deep neural networks, and more recently large-scale generative models such as large language models (LLMs) and large vision-action models (LVAs), achieve remarkable performance across diverse domains, yet their prohibitive computational cost hinders deployment in resource-constrained...

1 min 1 month, 1 week ago

ear

LOW Academic International

Khatri-Rao Clustering for Data Summarization

arXiv:2603.06602v1 Announce Type: new Abstract: As datasets continue to grow in size and complexity, finding succinct yet accurate data summaries poses a key challenge. Centroid-based clustering, a widely adopted approach to address this challenge, finds informative summaries of datasets in...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Scale Dependent Data Duplication

arXiv:2603.06603v1 Announce Type: new Abstract: Data duplication during pretraining can degrade generalization and lead to memorization, motivating aggressive deduplication pipelines. However, at web scale, it is unclear what constitutes a ``duplicate'': beyond surface-form matches, semantically equivalent documents (e.g. translations) may...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection

arXiv:2603.06604v1 Announce Type: new Abstract: As large language models (LLMs) are increasingly deployed in critical decision-making systems, the lack of reliable methods to measure their uncertainty presents a fundamental trustworthiness risk. We introduce a normalized confidence score based on output...

1 min 1 month, 1 week ago

ear

LOW Academic International

Structure-Aware Set Transformers: Temporal and Variable-Type Attention Biases for Asynchronous Clinical Time Series

arXiv:2603.06605v1 Announce Type: new Abstract: Electronic health records (EHR) are irregular, asynchronous multivariate time series. As time-series foundation models increasingly tokenize events rather than discretizing time, the input layout becomes a key design choice. Grids expose time$\times$variable structure but require...

1 min 1 month, 1 week ago

ear

LOW Academic International

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

arXiv:2603.06609v1 Announce Type: new Abstract: Modern machine learning models are highly expressive but notoriously difficult to analyze statistically. In particular, while black-box predictors can achieve strong empirical performance, they rarely provide valid hypothesis tests or p-values for assessing whether individual...

1 min 1 month, 1 week ago

ear

LOW Academic International

OptiRoulette Optimizer: A New Stochastic Meta-Optimizer for up to 5.3x Faster Convergence

arXiv:2603.06613v1 Announce Type: new Abstract: This paper presents OptiRoulette, a stochastic meta-optimizer that selects update rules during training instead of fixing a single optimizer. The method combines warmup optimizer locking, random sampling from an active optimizer pool, compatibility-aware learning-rate scaling...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

Correlation Analysis of Generative Models

arXiv:2603.06614v1 Announce Type: new Abstract: Based on literature review about existing diffusion models and flow matching with a neural network to predict a predefined target from noisy data, a unified representation is first proposed for these models using two simple...

1 min 1 month, 1 week ago

ear

LOW Academic International

Annealed Co-Generation: Disentangling Variables via Progressive Pairwise Modeling

arXiv:2603.06615v1 Announce Type: new Abstract: For multivariate co-generation in scientific applications, we advocate pairwise block rather than joint modeling of all variables. This design mitigates the computational burden and data imbalance. To this end, we propose an Annealed Co-Generation (ACG)...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

Distilling and Adapting: A Topology-Aware Framework for Zero-Shot Interaction Prediction in Multiplex Biological Networks

arXiv:2603.06618v1 Announce Type: new Abstract: Multiplex Biological Networks (MBNs), which represent multiple interaction types between entities, are crucial for understanding complex biological systems. Yet, existing methods often inadequately model multiplexity, struggle to integrate structural and sequence information, and face difficulties...

1 min 1 month, 1 week ago

ear

LOW Academic International

Not all tokens are needed(NAT): token efficient reinforcement learning

arXiv:2603.06619v1 Announce Type: new Abstract: Reinforcement learning (RL) has become a key driver of progress in large language models, but scaling RL to long chain-of-thought (CoT) trajectories is increasingly constrained by backpropagation over every generated token. Even with optimized rollout...

1 min 1 month, 1 week ago

ear

LOW Academic International

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

arXiv:2603.06621v1 Announce Type: new Abstract: Process Reward Models (PRMs) are rapidly becoming the backbone of LLM reasoning pipelines, yet we demonstrate that state-of-the-art PRMs are systematically exploitable under adversarial optimization pressure. To address this, we introduce a three-tiered diagnostic framework...

1 min 1 month, 1 week ago

ear

LOW Academic International

From ARIMA to Attention: Power Load Forecasting Using Temporal Deep Learning

arXiv:2603.06622v1 Announce Type: new Abstract: Accurate short-term power load forecasting is important to effectively manage, optimize, and ensure the robustness of modern power systems. This paper performs an empirical evaluation of a traditional statistical model and deep learning approaches for...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Advances in GRPO for Generation Models: A Survey

arXiv:2603.06623v1 Announce Type: new Abstract: Large-scale flow matching models have achieved strong performance across generative tasks such as text-to-image, video, 3D, and speech synthesis. However, aligning their outputs with human preferences and task-specific objectives remains challenging. Flow-GRPO extends Group Relative...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Pavement Missing Condition Data Imputation through Collective Learning-Based Graph Neural Networks

arXiv:2603.06625v1 Announce Type: new Abstract: Pavement condition data is important in providing information regarding the current state of the road network and in determining the needs of maintenance and rehabilitation treatments. However, the condition data is often incomplete due to...

1 min 1 month, 1 week ago

ear

LOW Academic International

Grouter: Decoupling Routing from Representation for Accelerated MoE Training

arXiv:2603.06626v1 Announce Type: new Abstract: Traditional Mixture-of-Experts (MoE) training typically proceeds without any structural priors, effectively requiring the model to simultaneously train expert weights while searching for an optimal routing policy within a vast combinatorial space. This entanglement often leads...

1 min 1 month, 1 week ago

ear

LOW Academic International

A new Uncertainty Principle in Machine Learning

arXiv:2603.06634v1 Announce Type: new Abstract: Many scientific problems in the context of machine learning can be reduced to the search of polynomial answers in appropriate variables. The Hevisidization of arbitrary polynomial is actually provided by one-and-the same two-layer expression. What...

1 min 1 month, 1 week ago

ear

LOW Academic International

Graph Property Inference in Small Language Models: Effects of Representation and Inference Strategy

arXiv:2603.06635v1 Announce Type: new Abstract: Recent progress in language modeling has expanded the range of tasks that can be approached through natural language interfaces, including problems that require structured reasoning. However, it remains unclear how effectively limited-capacity language models can...

1 min 1 month, 1 week ago

ear

LOW Academic International

SmartBench: Evaluating LLMs in Smart Homes with Anomalous Device States and Behavioral Contexts

arXiv:2603.06636v1 Announce Type: new Abstract: Due to the strong context-awareness capabilities demonstrated by large language models (LLMs), recent research has begun exploring their integration into smart home assistants to help users manage and adjust their living environments. While LLMs have...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

HEARTS: Benchmarking LLM Reasoning on Health Time Series

arXiv:2603.06638v1 Announce Type: new Abstract: The rise of large language models (LLMs) has shifted time series analysis from narrow analytics to general-purpose reasoning. Yet, existing benchmarks cover only a small set of health time series modalities and tasks, failing to...

1 min 1 month, 1 week ago

ear

LOW Academic International

SR-TTT: Surprisal-Aware Residual Test-Time Training

arXiv:2603.06642v1 Announce Type: new Abstract: Test-Time Training (TTT) language models achieve theoretically infinite context windows with an O(1) memory footprint by replacing the standard exact-attention KV-cache with hidden state ``fast weights'' W_fast updated via self-supervised learning during inference. However, pure...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Trust Aware Federated Learning for Secure Bone Healing Stage Interpretation in e-Health

arXiv:2603.06646v1 Announce Type: new Abstract: This paper presents a trust aware federated learning (FL) framework for interpreting bone healing stages using spectral features derived from frequency response data. The primary objective is to address the challenge posed by either unreliable...

1 min 1 month, 1 week ago

ear

LOW Academic United States

HURRI-GAN: A Novel Approach for Hurricane Bias-Correction Beyond Gauge Stations using Generative Adversarial Networks

arXiv:2603.06649v1 Announce Type: new Abstract: The coastal regions of the eastern and southern United States are impacted by severe storm events, leading to significant loss of life and properties. Accurately forecasting storm surge and wind impacts from hurricanes is essential...

1 min 1 month, 1 week ago

ear

LOW Academic European Union

Geodesic Gradient Descent: A Generic and Learning-rate-free Optimizer on Objective Function-induced Manifolds

arXiv:2603.06651v1 Announce Type: new Abstract: Euclidean gradient descent algorithms barely capture the geometry of objective function-induced hypersurfaces and risk driving update trajectories off the hypersurfaces. Riemannian gradient descent algorithms address these issues but fail to represent complex hypersurfaces via a...

1 min 1 month, 1 week ago

ear

LOW Academic United States

ERP-RiskBench: Leakage-Safe Ensemble Learning for Financial Risk

arXiv:2603.06671v1 Announce Type: new Abstract: Financial risk detection in Enterprise Resource Planning (ERP) systems is an important but underexplored application of machine learning. Published studies in this area tend to suffer from vague dataset descriptions, leakage-prone pipelines, and evaluation practices...

1 min 1 month, 1 week ago

ear

LOW Academic United States

Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces

arXiv:2603.06713v1 Announce Type: new Abstract: Agentic systems operating over large tool ecosystems must plan and execute long-horizon workflows under weak or non-verifiable supervision. While frontier models mitigate these challenges through scale and large context budgets, small language models (SLMs) remain...

1 min 1 month, 1 week ago

ear

LOW Academic International

From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories

arXiv:2603.06720v1 Announce Type: new Abstract: Access to electronic health records (EHRs) for digital health research is often limited by privacy regulations and institutional barriers. Synthetic EHRs have been proposed as a way to enable safe and sovereign data sharing; however,...

1 min 1 month, 1 week ago

ear

Dual-Metric Evaluation of Social Bias in Large Language Models: Evidence from an Underrepresented Nepali Cultural Context

An Efficient and Effective Evaluator for Text2SQL Models on Unseen and Unlabeled Data

How Attention Sinks Emerge in Large Language Models: An Interpretability Perspective

Switchable Activation Networks

Khatri-Rao Clustering for Data Summarization

Scale Dependent Data Duplication

Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection

Structure-Aware Set Transformers: Temporal and Variable-Type Attention Biases for Asynchronous Clinical Time Series

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

OptiRoulette Optimizer: A New Stochastic Meta-Optimizer for up to 5.3x Faster Convergence

Correlation Analysis of Generative Models

Annealed Co-Generation: Disentangling Variables via Progressive Pairwise Modeling

Distilling and Adapting: A Topology-Aware Framework for Zero-Shot Interaction Prediction in Multiplex Biological Networks

Not all tokens are needed(NAT): token efficient reinforcement learning

Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

From ARIMA to Attention: Power Load Forecasting Using Temporal Deep Learning

Advances in GRPO for Generation Models: A Survey

Pavement Missing Condition Data Imputation through Collective Learning-Based Graph Neural Networks

Grouter: Decoupling Routing from Representation for Accelerated MoE Training

A new Uncertainty Principle in Machine Learning

Graph Property Inference in Small Language Models: Effects of Representation and Inference Strategy

SmartBench: Evaluating LLMs in Smart Homes with Anomalous Device States and Behavioral Contexts

HEARTS: Benchmarking LLM Reasoning on Health Time Series

SR-TTT: Surprisal-Aware Residual Test-Time Training

Trust Aware Federated Learning for Secure Bone Healing Stage Interpretation in e-Health

HURRI-GAN: A Novel Approach for Hurricane Bias-Correction Beyond Gauge Stations using Generative Adversarial Networks

Geodesic Gradient Descent: A Generic and Learning-rate-free Optimizer on Objective Function-induced Manifolds

ERP-RiskBench: Leakage-Safe Ensemble Learning for Financial Risk

Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces

From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.