Arbitration

LOW Academic International

VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning

arXiv:2602.18429v1 Announce Type: new Abstract: Large Language Models (LLMs) have made significant progress in reasoning tasks across various domains such as mathematics and coding. However, their performance deteriorates in tasks requiring rich socio-cultural knowledge and diverse local contexts, particularly those...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

arXiv:2602.17681v1 Announce Type: cross Abstract: Post-training quantization (PTQ) is a widely used approach for reducing the memory and compute costs of large language models (LLMs). Recent studies have shown that applying invertible transformations to activations can significantly improve quantization robustness...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering

arXiv:2602.17691v1 Announce Type: cross Abstract: Quantized language models face a fundamental dilemma: low sampling temperatures yield repetitive, mode-collapsed outputs, while high temperatures (T > 2.0) cause trajectory divergence and semantic incoherence. We present HELIX, a geometric framework that decouples output...

1 min 1 month, 3 weeks ago

bit

LOW Academic United States

TFL: Targeted Bit-Flip Attack on Large Language Model

arXiv:2602.17837v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed in safety and security critical applications, raising concerns about their robustness to model parameter fault injection attacks. Recent studies have shown that bit-flip attacks (BFAs), which exploit computer...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory

arXiv:2602.18297v1 Announce Type: cross Abstract: Chain-of-thought (CoT) monitors are LLM-based systems that analyze reasoning traces to detect when outputs may exhibit attributes of interest, such as test-hacking behavior during code generation. In this paper, we use information-theoretic analysis to show...

1 min 1 month, 3 weeks ago

bit

LOW Academic European Union

On the "Induction Bias" in Sequence Models

arXiv:2602.18333v1 Announce Type: cross Abstract: Despite the remarkable practical success of transformer-based language models, recent work has raised concerns about their ability to perform state tracking. In particular, a growing body of literature has shown this limitation primarily through failures...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs

arXiv:2602.17680v1 Announce Type: new Abstract: Existing Protein Language Models (PLMs) often suffer from limited adaptability to multiple tasks and exhibit poor generalization across diverse biological contexts. In contrast, general-purpose Large Language Models (LLMs) lack the capability to interpret protein sequences...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Parallel Complex Diffusion for Scalable Time Series Generation

arXiv:2602.17706v1 Announce Type: new Abstract: Modeling long-range dependencies in time series generation poses a fundamental trade-off between representational capacity and computational efficiency. Traditional temporal diffusion models suffer from local entanglement and the $\mathcal{O}(L^2)$ cost of attention mechanisms. We address these...

1 min 1 month, 3 weeks ago

adr

LOW Academic International

Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

arXiv:2602.17829v1 Announce Type: new Abstract: Inferring causal relations in timeseries data with delayed effects is a fundamental challenge, especially when the underlying system exhibits complex dynamics that cannot be captured by simple functional mappings. Traditional approaches often fail to produce...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

MePoly: Max Entropy Polynomial Policy Optimization

arXiv:2602.17832v1 Announce Type: new Abstract: Stochastic Optimal Control provides a unified mathematical framework for solving complex decision-making problems, encompassing paradigms such as maximum entropy reinforcement learning(RL) and imitation learning(IL). However, conventional parametric policies often struggle to represent the multi-modality of...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models

arXiv:2602.17846v1 Announce Type: new Abstract: Diffusion models generate high-quality samples but can also memorize training data, raising serious privacy concerns. Understanding the mechanisms governing when memorization versus generalization occurs remains an active area of research. In particular, it is unclear...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Dual Length Codes for Lossless Compression of BFloat16

arXiv:2602.17849v1 Announce Type: new Abstract: Training and serving Large Language Models (LLMs) relies heavily on parallelization and collective operations, which are frequently bottlenecked by network bandwidth. Lossless compression using e.g., Huffman codes can alleviate the issue, however, Huffman codes suffer...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Distribution-Free Sequential Prediction with Abstentions

arXiv:2602.17918v1 Announce Type: new Abstract: We study a sequential prediction problem in which an adversary is allowed to inject arbitrarily many adversarial instances in a stream of i.i.d.\ instances, but at each round, the learner may also \emph{abstain} from making...

1 min 1 month, 3 weeks ago

bit

LOW Academic United States

X-MAP: eXplainable Misclassification Analysis and Profiling for Spam and Phishing Detection

arXiv:2602.15298v1 Announce Type: new Abstract: Misclassifications in spam and phishing detection are very harmful, as false negatives expose users to attacks while false positives degrade trust. Existing uncertainty-based detectors can flag potential errors, but possibly be deceived and offer limited...

1 min 1 month, 3 weeks ago

bit

LOW Academic United States

Quantifying construct validity in large language model evaluations

arXiv:2602.15532v1 Announce Type: new Abstract: The LLM community often reports benchmark results as if they are synonymous with general model capabilities. However, benchmarks can have problems that distort performance, like test set contamination and annotator error. How can we know...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

LemonadeBench: Evaluating the Economic Intuition of Large Language Models in Simple Markets

arXiv:2602.13209v1 Announce Type: cross Abstract: We introduce LemonadeBench v0.5, a minimal benchmark for evaluating economic intuition, long-term planning, and decision-making under uncertainty in large language models (LLMs) through a simulated lemonade stand business. Models must manage inventory with expiring goods,...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

CircuChain: Disentangling Competence and Compliance in LLM Circuit Analysis

arXiv:2602.15037v1 Announce Type: cross Abstract: As large language models (LLMs) advance toward expert-level performance in engineering domains, reliable reasoning under user-specified constraints becomes critical. In circuit analysis, for example, a numerically correct solution is insufficient if it violates established methodological...

1 min 1 month, 3 weeks ago

bit

LOW Academic United States

Combining scEEG and PPG for reliable sleep staging using lightweight wearables

arXiv:2602.15042v1 Announce Type: cross Abstract: Reliable sleep staging remains challenging for lightweight wearable devices such as single-channel electroencephalography (scEEG) or photoplethysmography (PPG). scEEG offers direct measurement of cortical activity and serves as the foundation for sleep staging, yet exhibits limited...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Safe-SDL:Establishing Safety Boundaries and Control Mechanisms for AI-Driven Self-Driving Laboratories

arXiv:2602.15061v1 Announce Type: cross Abstract: The emergence of Self-Driving Laboratories (SDLs) transforms scientific discovery methodology by integrating AI with robotic automation to create closed-loop experimental systems capable of autonomous hypothesis generation, experimentation, and analysis. While promising to compress research timelines...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Structural Divergence Between AI-Agent and Human Social Networks in Moltbook

arXiv:2602.15064v1 Announce Type: cross Abstract: Large populations of AI agents are increasingly embedded in online environments, yet little is known about how their collective interaction patterns compare to human social systems. Here, we analyze the full interaction network of Moltbook,...

1 min 1 month, 3 weeks ago

bit

LOW Academic European Union

S-PRESSO: Ultra Low Bitrate Sound Effect Compression With Diffusion Autoencoders And Offline Quantization

arXiv:2602.15082v1 Announce Type: cross Abstract: Neural audio compression models have recently achieved extreme compression rates, enabling efficient latent generative modeling. Conversely, latent generative models have been applied to compression, pushing the limits of continuous and discrete approaches. However, existing methods...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

StrokeNeXt: A Siamese-encoder Approach for Brain Stroke Classification in Computed Tomography Imagery

arXiv:2602.15087v1 Announce Type: cross Abstract: We present StrokeNeXt, a model for stroke classification in 2D Computed Tomography (CT) images. StrokeNeXt employs a dual-branch design with two ConvNeXt encoders, whose features are fused through a lightweight convolutional decoder based on stacked...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Far Out: Evaluating Language Models on Slang in Australian and Indian English

arXiv:2602.15373v1 Announce Type: new Abstract: Language models exhibit systematic performance gaps when processing text in non-standard language varieties, yet their ability to comprehend variety-specific slang remains underexplored for several languages. We present a comprehensive evaluation of slang awareness in Indian...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations

arXiv:2602.15456v1 Announce Type: new Abstract: Agents based on Large Language Models (LLMs) are increasingly being deployed as interfaces to information on online platforms. These agents filter, prioritize, and synthesize information retrieved from the platforms' back-end databases or via web search....

1 min 1 month, 3 weeks ago

bit

LOW Academic European Union

ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

arXiv:2602.15521v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) effectively scales model capacity while preserving computational efficiency through sparse expert activation. However, training high-quality MoEs from scratch is prohibitively expensive. A promising alternative is to convert pretrained dense models into sparse MoEs....

1 min 1 month, 3 weeks ago

bit

LOW Academic International

ZeroSyl: Simple Zero-Resource Syllable Tokenization for Spoken Language Modeling

arXiv:2602.15537v1 Announce Type: new Abstract: Pure speech language models aim to learn language directly from raw audio without textual resources. A key challenge is that discrete tokens from self-supervised speech encoders result in excessively long sequences, motivating recent work on...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

arXiv:2602.16039v1 Announce Type: new Abstract: The rapid rise of large language models (LLMs) is reshaping the landscape of automatic assessment in education. While these systems demonstrate substantial advantages in adaptability to diverse question types and flexibility in output formats, they...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

EdgeNav-QE: QLoRA Quantization and Dynamic Early Exit for LAM-based Navigation on Edge Devices

arXiv:2602.15836v1 Announce Type: cross Abstract: Large Action Models (LAMs) have shown immense potential in autonomous navigation by bridging high-level reasoning with low-level control. However, deploying these multi-billion parameter models on edge devices remains a significant challenge due to memory constraints...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Do Personality Traits Interfere? Geometric Limitations of Steering in Large Language Models

arXiv:2602.15847v1 Announce Type: cross Abstract: Personality steering in large language models (LLMs) commonly relies on injecting trait-specific steering vectors, implicitly assuming that personality traits can be controlled independently. In this work, we examine whether this assumption holds by analysing the...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Building Safe and Deployable Clinical Natural Language Processing under Temporal Leakage Constraints

arXiv:2602.15852v1 Announce Type: cross Abstract: Clinical natural language processing (NLP) models have shown promise for supporting hospital discharge planning by leveraging narrative clinical documentation. However, note-based models are particularly vulnerable to temporal and lexical leakage, where documentation artifacts encode future...

1 min 1 month, 4 weeks ago

bit

VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning

LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering

TFL: Targeted Bit-Flip Attack on Large Language Model

Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory

On the "Induction Bias" in Sequence Models

BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs

Parallel Complex Diffusion for Scalable Time Series Generation

Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

MePoly: Max Entropy Polynomial Policy Optimization

Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models

Dual Length Codes for Lossless Compression of BFloat16

Distribution-Free Sequential Prediction with Abstentions

X-MAP: eXplainable Misclassification Analysis and Profiling for Spam and Phishing Detection

Quantifying construct validity in large language model evaluations

LemonadeBench: Evaluating the Economic Intuition of Large Language Models in Simple Markets

CircuChain: Disentangling Competence and Compliance in LLM Circuit Analysis

Combining scEEG and PPG for reliable sleep staging using lightweight wearables

Safe-SDL:Establishing Safety Boundaries and Control Mechanisms for AI-Driven Self-Driving Laboratories

Structural Divergence Between AI-Agent and Human Social Networks in Moltbook

S-PRESSO: Ultra Low Bitrate Sound Effect Compression With Diffusion Autoencoders And Offline Quantization

StrokeNeXt: A Siamese-encoder Approach for Brain Stroke Classification in Computed Tomography Imagery

Far Out: Evaluating Language Models on Slang in Australian and Indian English

In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations

ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

ZeroSyl: Simple Zero-Resource Syllable Tokenization for Spoken Language Modeling

How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

EdgeNav-QE: QLoRA Quantization and Dynamic Early Exit for LAM-based Navigation on Edge Devices

Do Personality Traits Interfere? Geometric Limitations of Steering in Large Language Models

Building Safe and Deployable Clinical Natural Language Processing under Temporal Leakage Constraints

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.