Arbitration

LOW Academic United States

Genetic Generalized Additive Models

arXiv:2602.15877v1 Announce Type: cross Abstract: Generalized Additive Models (GAMs) balance predictive accuracy and interpretability, but manually configuring their structure is challenging. We propose using the multi-objective genetic algorithm NSGA-II to automatically optimize GAMs, jointly minimizing prediction error (RMSE) and a...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

IT-OSE: Exploring Optimal Sample Size for Industrial Data Augmentation

arXiv:2602.15878v1 Announce Type: cross Abstract: In industrial scenarios, data augmentation is an effective approach to improve model performance. However, its benefits are not unidirectionally beneficial. There is no theoretical research or established estimation for the optimal sample size (OSS) in...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

FUTURE-VLA: Forecasting Unified Trajectories Under Real-time Execution

arXiv:2602.15882v1 Announce Type: cross Abstract: General vision-language models increasingly support unified spatiotemporal reasoning over long video streams, yet deploying such capabilities on robots remains constrained by the prohibitive latency of processing long-horizon histories and generating high-dimensional future predictions. To bridge...

1 min 1 month, 4 weeks ago

bit

LOW Academic European Union

NeuroSleep: Neuromorphic Event-Driven Single-Channel EEG Sleep Staging for Edge-Efficient Sensing

arXiv:2602.15888v1 Announce Type: cross Abstract: Reliable, continuous neural sensing on wearable edge platforms is fundamental to long-term health monitoring; however, for electroencephalography (EEG)-based sleep monitoring, dense high-frequency processing is often computationally prohibitive under tight energy budgets. To address this bottleneck,...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Doc-to-LoRA: Learning to Instantly Internalize Contexts

arXiv:2602.15902v1 Announce Type: cross Abstract: Long input sequences are central to in-context learning, document understanding, and multi-step reasoning of Large Language Models (LLMs). However, the quadratic attention cost of Transformers makes inference memory-intensive and slow. While context distillation (CD) can...

1 min 1 month, 4 weeks ago

adr

LOW Academic International

RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models

arXiv:2602.17053v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) exhibit strong performance, yet often produce rationales that sound plausible but fail to reflect their true decision process, undermining reliability and trust. We introduce a formal framework for reasoning faithfulness, defined...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses

arXiv:2602.17084v1 Announce Type: new Abstract: The rapid adoption of large language models has led to the emergence of AI coding agents that autonomously create pull requests on GitHub. However, how these agents differ in their pull request description characteristics, and...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Owen-based Semantics and Hierarchy-Aware Explanation (O-Shap)

arXiv:2602.17107v1 Announce Type: new Abstract: Shapley value-based methods have become foundational in explainable artificial intelligence (XAI), offering theoretically grounded feature attributions through cooperative game theory. However, in practice, particularly in vision tasks, the assumption of feature independence breaks down, as...

1 min 1 month, 4 weeks ago

bit

LOW Academic United States

Decoding the Human Factor: High Fidelity Behavioral Prediction for Strategic Foresight

arXiv:2602.17222v1 Announce Type: new Abstract: Predicting human decision-making in high-stakes environments remains a central challenge for artificial intelligence. While large language models (LLMs) demonstrate strong general reasoning, they often struggle to generate consistent, individual-specific behavior, particularly when accurate prediction depends...

1 min 1 month, 4 weeks ago

bit

LOW Academic United States

One-step Language Modeling via Continuous Denoising

arXiv:2602.16813v1 Announce Type: new Abstract: Language models based on discrete diffusion have attracted widespread interest for their potential to provide faster generation than autoregressive models. In practice, however, they exhibit a sharp degradation of sample quality in the few-step regime,...

1 min 1 month, 4 weeks ago

bit

LOW Academic United States

BankMathBench: A Benchmark for Numerical Reasoning in Banking Scenarios

arXiv:2602.17072v1 Announce Type: new Abstract: Large language models (LLMs)-based chatbots are increasingly being adopted in the financial domain, particularly in digital banking, to handle customer inquiries about products such as deposits, savings, and loans. However, these models still exhibit low...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

arXiv:2602.17283v1 Announce Type: new Abstract: While large language models (LLMs) have become pivotal to content safety, current evaluation paradigms primarily focus on detecting explicit harms (e.g., violence or hate speech), neglecting the subtler value dimensions conveyed in digital content. To...

1 min 1 month, 4 weeks ago

bit

LOW Academic European Union

Representation Collapse in Machine Translation Through the Lens of Angular Dispersion

arXiv:2602.17287v1 Announce Type: new Abstract: Modern neural translation models based on the Transformer architecture are known for their high performance, particularly when trained on high-resource datasets. A standard next-token prediction training strategy, while widely adopted in practice, may lead to...

1 min 1 month, 4 weeks ago

bit

LOW Conference United Kingdom

ICLR 2026 Program Committee

12 min 1 month, 4 weeks ago

adr

LOW Academic International

ABCD: All Biases Come Disguised

arXiv:2602.17445v1 Announce Type: new Abstract: Multiple-choice question (MCQ) benchmarks have been a standard evaluation practice for measuring LLMs' ability to reason and answer knowledge-based questions. Through a synthetic NonsenseQA benchmark, we observe that different LLMs exhibit varying degrees of label-position-few-shot-prompt...

1 min 1 month, 4 weeks ago

bit

LOW Academic European Union

Entropy-Based Data Selection for Language Models

arXiv:2602.17465v1 Announce Type: new Abstract: Modern language models (LMs) increasingly require two critical resources: computational resources and data resources. Data selection techniques can effectively reduce the amount of training data required for fine-tuning LMs. However, their effectiveness is closely related...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Auditing Reciprocal Sentiment Alignment: Inversion Risk, Dialect Representation and Intent Misalignment in Transformers

arXiv:2602.17469v1 Announce Type: new Abstract: The core theme of bidirectional alignment is ensuring that AI systems accurately understand human intent and that humans can trust AI behavior. However, this loop fractures significantly across language barriers. Our research addresses Cross-Lingual Sentiment...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Unmasking the Factual-Conceptual Gap in Persian Language Models

arXiv:2602.17623v1 Announce Type: new Abstract: While emerging Persian NLP benchmarks have expanded into pragmatics and politeness, they rarely distinguish between memorized cultural facts and the ability to reason about implicit social norms. We introduce DivanBench, a diagnostic benchmark focused on...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Differences in Typological Alignment in Language Models' Treatment of Differential Argument Marking

arXiv:2602.17653v1 Announce Type: new Abstract: Recent work has shown that language models (LMs) trained on synthetic corpora can exhibit typological preferences that resemble cross-linguistic regularities in human languages, particularly for syntactic phenomena such as word order. In this paper, we...

1 min 1 month, 4 weeks ago

bit

LOW Academic European Union

Sink-Aware Pruning for Diffusion Language Models

arXiv:2602.17664v1 Announce Type: new Abstract: Diffusion Language Models (DLMs) incur high inference cost due to iterative denoising, motivating efficient pruning. Existing pruning heuristics largely inherited from autoregressive (AR) LLMs, typically preserve attention sink tokens because AR sinks serve as stable...

1 min 1 month, 4 weeks ago

bit

LOW Academic United States

Omitted Variable Bias in Language Models Under Distribution Shift

arXiv:2602.16784v1 Announce Type: cross Abstract: Despite their impressive performance on a wide variety of tasks, modern language models remain susceptible to distribution shifts, exhibiting brittle behavior when evaluated on data that differs in distribution from their training data. In this...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Quantifying LLM Attention-Head Stability: Implications for Circuit Universality

arXiv:2602.16740v1 Announce Type: new Abstract: In mechanistic interpretability, recent work scrutinizes transformer "circuits" - sparse, mono or multi layer sub computations, that may reflect human understandable functions. Yet, these network circuits are rarely acid-tested for their stability across different instances...

1 min 1 month, 4 weeks ago

bit

LOW Academic European Union

Machine Learning Argument of Latitude Error Model for LEO Satellite Orbit and Covariance Correction

arXiv:2602.16764v1 Announce Type: new Abstract: Low Earth orbit (LEO) satellites are leveraged to support new position, navigation, and timing (PNT) service alternatives to GNSS. These alternatives require accurate propagation of satellite position and velocity with a realistic quantification of uncertainty....

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Escaping the Cognitive Well: Efficient Competition Math with Off-the-Shelf Models

arXiv:2602.16793v1 Announce Type: new Abstract: In the past year, custom and unreleased math reasoning models reached gold medal performance on the International Mathematical Olympiad (IMO). Similar performance was then reported using large-scale inference on publicly available models but at prohibitive...

1 min 1 month, 4 weeks ago

bit

LOW Academic European Union

Exact Certification of Data-Poisoning Attacks Using Mixed-Integer Programming

arXiv:2602.16944v1 Announce Type: new Abstract: This work introduces a verification framework that provides both sound and complete guarantees for data poisoning attacks during neural network training. We formulate adversarial data manipulation, model training, and test-time evaluation in a single mixed-integer...

1 min 1 month, 4 weeks ago

adr

LOW Academic International

Discovering Universal Activation Directions for PII Leakage in Language Models

arXiv:2602.16980v1 Announce Type: new Abstract: Modern language models exhibit rich internal structure, yet little is known about how privacy-sensitive behaviors, such as personally identifiable information (PII) leakage, are represented and modulated within their hidden states. We present UniLeak, a mechanistic-interpretability...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression

arXiv:2602.17063v1 Announce Type: new Abstract: Sub-bit model compression seeks storage below one bit per weight; as magnitudes are aggressively compressed, the sign bit becomes a fixed-cost bottleneck. Across Transformers, CNNs, and MLPs, learned sign matrices resist low-rank approximation and are...

1 min 1 month, 4 weeks ago

bit

LOW News United States

The creator economy’s ad revenue problem and India’s AI ambitions

The creator economy is evolving fast, and ad revenue alone isn’t cutting it anymore. YouTubers are launching product lines, acquiring startups, and building actual business empires. In fact, MrBeast’s company bought fintech startup Step, and his chocolate business is outearning...

1 min 1 month, 4 weeks ago

bit

LOW Academic International

Multi-source Heterogeneous Public Opinion Analysis via Collaborative Reasoning and Adaptive Fusion: A Systematically Integrated Approach

arXiv:2602.15857v1 Announce Type: new Abstract: The analysis of public opinion from multiple heterogeneous sources presents significant challenges due to structural differences, semantic variations, and platform-specific biases. This paper introduces a novel Collaborative Reasoning and Adaptive Fusion (CRAF) framework that systematically...

1 min 1 month, 4 weeks ago

bit

LOW Academic South Korea

From Transcripts to AI Agents: Knowledge Extraction, RAG Integration, and Robust Evaluation of Conversational AI Assistants

arXiv:2602.15859v1 Announce Type: new Abstract: Building reliable conversational AI assistants for customer-facing industries remains challenging due to noisy conversational data, fragmented knowledge, and the requirement for accurate human hand-off - particularly in domains that depend heavily on real-time information. This...

1 min 1 month, 4 weeks ago

bit

Genetic Generalized Additive Models

IT-OSE: Exploring Optimal Sample Size for Industrial Data Augmentation

FUTURE-VLA: Forecasting Unified Trajectories Under Real-time Execution

NeuroSleep: Neuromorphic Event-Driven Single-Channel EEG Sleep Staging for Edge-Efficient Sensing

Doc-to-LoRA: Learning to Instantly Internalize Contexts

RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models

How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses

Owen-based Semantics and Hierarchy-Aware Explanation (O-Shap)

Decoding the Human Factor: High Fidelity Behavioral Prediction for Strategic Foresight

One-step Language Modeling via Continuous Denoising

BankMathBench: A Benchmark for Numerical Reasoning in Banking Scenarios

Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

Representation Collapse in Machine Translation Through the Lens of Angular Dispersion

ICLR 2026 Program Committee

ABCD: All Biases Come Disguised

Entropy-Based Data Selection for Language Models

Auditing Reciprocal Sentiment Alignment: Inversion Risk, Dialect Representation and Intent Misalignment in Transformers

Unmasking the Factual-Conceptual Gap in Persian Language Models

Differences in Typological Alignment in Language Models' Treatment of Differential Argument Marking

Sink-Aware Pruning for Diffusion Language Models

Omitted Variable Bias in Language Models Under Distribution Shift

Quantifying LLM Attention-Head Stability: Implications for Circuit Universality

Machine Learning Argument of Latitude Error Model for LEO Satellite Orbit and Covariance Correction

Escaping the Cognitive Well: Efficient Competition Math with Off-the-Shelf Models

Exact Certification of Data-Poisoning Attacks Using Mixed-Integer Programming

Discovering Universal Activation Directions for PII Leakage in Language Models

Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression

The creator economy’s ad revenue problem and India’s AI ambitions

Multi-source Heterogeneous Public Opinion Analysis via Collaborative Reasoning and Adaptive Fusion: A Systematically Integrated Approach

From Transcripts to AI Agents: Knowledge Extraction, RAG Integration, and Robust Evaluation of Conversational AI Assistants

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.