Arbitration

LOW Academic International

Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO

arXiv:2603.21016v1 Announce Type: new Abstract: Large language models (LLMs) used for multiple-choice and pairwise evaluation tasks often exhibit selection bias due to non-semantic factors like option positions and label symbols. Existing inference-time debiasing is costly and may harm reasoning, while...

1 min 3 weeks, 4 days ago

bit

LOW Academic International

Collaborative Adaptive Curriculum for Progressive Knowledge Distillation

arXiv:2603.20296v1 Announce Type: new Abstract: Recent advances in collaborative knowledge distillation have demonstrated cutting-edge performance for resource-constrained distributed multimedia learning scenarios. However, achieving such competitiveness requires addressing a fundamental mismatch: high-dimensional teacher knowledge complexity versus heterogeneous client learning capacities, which...

1 min 3 weeks, 4 days ago

bit

LOW Academic United States

Probing the Latent World: Emergent Discrete Symbols and Physical Structure in Latent Representations

arXiv:2603.20327v1 Announce Type: new Abstract: Video world models trained with Joint Embedding Predictive Architectures (JEPA) acquire rich spatiotemporal representations by predicting masked regions in latent space rather than reconstructing pixels. This removes the visual verification pathway of generative models, creating...

1 min 3 weeks, 4 days ago

bit

LOW Academic International

The Multiverse of Time Series Machine Learning: an Archive for Multivariate Time Series Classification

arXiv:2603.20352v1 Announce Type: new Abstract: Time series machine learning (TSML) is a growing research field that spans a wide range of tasks. The popularity of established tasks such as classification, clustering, and extrinsic regression has, in part, been driven by...

1 min 3 weeks, 4 days ago

adr

LOW Academic International

Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

arXiv:2603.20406v1 Announce Type: new Abstract: We investigate whether independently trained language models converge to geometrically compatible latent representations, and whether this compatibility can be exploited to correct model behavior at inference time without any weight updates. We learn a linear...

1 min 3 weeks, 4 days ago

bit

LOW Academic European Union

Reinforcement Learning from Multi-Source Imperfect Preferences: Best-of-Both-Regimes Regret

arXiv:2603.20453v1 Announce Type: new Abstract: Reinforcement learning from human feedback (RLHF) replaces hard-to-specify rewards with pairwise trajectory preferences, yet regret-oriented theory often assumes that preference labels are generated consistently from a single ground-truth objective. In practical RLHF systems, however, feedback...

1 min 3 weeks, 4 days ago

bit

LOW Academic United States

LJ-Bench: Ontology-Based Benchmark for U.S. Crime

arXiv:2603.20572v1 Announce Type: new Abstract: The potential of Large Language Models (LLMs) to provide harmful information remains a significant concern due to the vast breadth of illegal queries they may encounter. Unfortunately, existing benchmarks only focus on a handful types...

1 min 3 weeks, 4 days ago

bit

LOW Academic International

Bayesian Learning in Episodic Zero-Sum Games

arXiv:2603.20604v1 Announce Type: new Abstract: We study Bayesian learning in episodic, finite-horizon zero-sum Markov games with unknown transition and reward models. We investigate a posterior algorithm in which each player maintains a Bayesian posterior over the game model, independently samples...

1 min 3 weeks, 4 days ago

bit

LOW Academic European Union

CFNN: Continued Fraction Neural Network

arXiv:2603.20634v1 Announce Type: new Abstract: Accurately characterizing non-linear functional manifolds with singularities is a fundamental challenge in scientific computing. While Multi-Layer Perceptrons (MLPs) dominate, their spectral bias hinders resolving high-curvature features without excessive parameters. We introduce Continued Fraction Neural Networks...

1 min 3 weeks, 4 days ago

bit

LOW Academic International

Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness

arXiv:2603.20775v1 Announce Type: new Abstract: In personalized marketing, uplift models estimate incremental effects by modeling how customer behavior changes under alternative treatments. However, real-world data often exhibit biases - such as selection bias, spillover effects, and unobserved confounding - which...

1 min 3 weeks, 4 days ago

bit

LOW Academic European Union

MAPLE: Metadata Augmented Private Language Evolution

arXiv:2603.19258v1 Announce Type: cross Abstract: While differentially private (DP) fine-tuning of large language models (LLMs) is a powerful tool, it is often computationally prohibitive or infeasible when state-of-the-art models are only accessible via proprietary APIs. In such settings, generating DP...

1 min 3 weeks, 5 days ago

bit

LOW Academic International

When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)

arXiv:2603.19429v1 Announce Type: new Abstract: Classical planning problems are typically defined using lifted first-order representations, which offer compactness and generality. While most planners ground these representations to simplify reasoning, this can cause an exponential blowup in size. Recent approaches instead...

1 min 3 weeks, 5 days ago

adr

LOW Academic International

From Comprehension to Reasoning: A Hierarchical Benchmark for Automated Financial Research Reporting

arXiv:2603.19254v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used to generate financial research reports, shifting from auxiliary analytic tools to primary content producers. Yet recent real-world deployments reveal persistent failures--factual errors, numerical inconsistencies, fabricated references, and shallow...

1 min 3 weeks, 5 days ago

bit

LOW Academic International

Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword Merging

arXiv:2603.19261v1 Announce Type: new Abstract: Subword tokenization is a key design choice for modern language models, including large language models (LLMs), with byte- and character-level BPE serving as a widely used baseline. Standard BPE selects merges by raw pair frequency,...

1 min 3 weeks, 5 days ago

bit

LOW Academic International

LLM-MRD: LLM-Guided Multi-View Reasoning Distillation for Fake News Detection

arXiv:2603.19293v1 Announce Type: new Abstract: Multimodal fake news detection is crucial for mitigating societal disinformation. Existing approaches attempt to address this by fusing multimodal features or leveraging Large Language Models (LLMs) for advanced reasoning. However, these methods suffer from serious...

1 min 3 weeks, 5 days ago

bit

LOW Academic International

BrainSCL: Subtype-Guided Contrastive Learning for Brain Disorder Diagnosis

arXiv:2603.19295v1 Announce Type: new Abstract: Mental disorder populations exhibit pronounced heterogeneity -- that is, the significant differences between samples -- poses a significant challenge to the definition of positive pairs in contrastive learning. To address this, we propose a subtype-guided...

1 min 3 weeks, 5 days ago

bit

LOW Academic International

The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference

arXiv:2603.19664v1 Announce Type: new Abstract: The key-value (KV) cache is widely treated as essential state in transformer inference, and a large body of work engineers policies to compress, evict, or approximate its entries. We prove that this state is entirely...

1 min 3 weeks, 5 days ago

bit

LOW News International

Elon Musk unveils chip manufacturing plans for SpaceX and Tesla

Elon Musk recently outlined ambitious plans for a chip-building collaboration Tesla and SpaceX — but he has a history of overpromising.

1 min 3 weeks, 5 days ago

bit

LOW News United States

Unanimous court allows street preacher’s free speech case to move forward

A unanimous court on Friday sided with a Mississippi street preacher who sued to block future enforcement of a public demonstration ordinance that he was previously convicted of violating. A […]The postUnanimous court allows street preacher’s free speech case to...

1 min 4 weeks ago

enforcement

LOW Academic International

An Onto-Relational-Sophic Framework for Governing Synthetic Minds

arXiv:2603.18633v1 Announce Type: new Abstract: The rapid evolution of artificial intelligence, from task-specific systems to foundation models exhibiting broad, flexible competence across reasoning, creative synthesis, and social interaction, has outpaced the conceptual and governance frameworks designed to manage it. Current...

1 min 4 weeks, 1 day ago

bit

LOW Academic International

BenchBrowser -- Collecting Evidence for Evaluating Benchmark Validity

arXiv:2603.18019v1 Announce Type: new Abstract: Do language model benchmarks actually measure what practitioners intend them to ? High-level metadata is too coarse to convey the granular reality of benchmarks: a "poetry" benchmark may never test for haikus, while "instruction-following" benchmarks...

1 min 4 weeks, 1 day ago

bit

LOW Academic International

Multi-Trait Subspace Steering to Reveal the Dark Side of Human-AI Interaction

arXiv:2603.18085v1 Announce Type: new Abstract: Recent incidents have highlighted alarming cases where human-AI interactions led to negative psychological outcomes, including mental health crises and even user harm. As LLMs serve as sources of guidance, emotional support, and even informal therapy,...

1 min 4 weeks, 1 day ago

bit

LOW Academic International

MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation

arXiv:2603.18676v1 Announce Type: new Abstract: MANAR (Memory-augmented Attention with Navigational Abstract Conceptual Representation), contextualization layer generalizes standard multi-head attention (MHA) by instantiating the principles of Global Workspace Theory (GWT). While MHA enables unconstrained all-to-all communication, it lacks the functional bottleneck...

1 min 4 weeks, 1 day ago

adr

LOW Academic United States

AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba

arXiv:2603.18462v1 Announce Type: new Abstract: In the era of large-scale pre-trained models, effectively adapting general knowledge to specific affective computing tasks remains a challenge, particularly regarding computational efficiency and multimodal heterogeneity. While Transformer-based methods have excelled at modeling inter-modal dependencies,...

1 min 4 weeks, 1 day ago

adr

LOW Academic International

Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably

arXiv:2603.18563v1 Announce Type: new Abstract: AI agents are increasingly deployed in interactive economic environments characterized by repeated AI-AI interactions. Despite AI agents' advanced capabilities, empirical studies reveal that such interactions often fail to stably induce a strategic equilibrium, such as...

1 min 4 weeks, 1 day ago

bit

LOW Academic European Union

AS2 -- Attention-Based Soft Answer Sets: An End-to-End Differentiable Neuro-Soft-Symbolic Reasoning Architecture

arXiv:2603.18436v1 Announce Type: new Abstract: Neuro-symbolic artificial intelligence (AI) systems typically couple a neural perception module to a discrete symbolic solver through a non-differentiable boundary, preventing constraint-satisfaction feedback from reaching the perception encoder during training. We introduce AS2 (Attention-Based Soft...

1 min 4 weeks, 1 day ago

bit

LOW Academic United States

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

arXiv:2603.18729v1 Announce Type: new Abstract: Many works in the literature show that LLM outputs exhibit discriminatory behaviour, triggering stereotype-based inferences based on the dialect in which the inputs are written. This bias has been shown to be particularly pronounced when...

1 min 4 weeks, 1 day ago

bit

LOW Academic International

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange Stories Paradigm

arXiv:2603.18007v1 Announce Type: new Abstract: The study explores whether current Large Language Models (LLMs) exhibit Theory of Mind (ToM) capabilities -- specifically, the ability to infer others' beliefs, intentions, and emotions from text. Given that LLMs are trained on language...

1 min 4 weeks, 1 day ago

bit

LOW Academic International

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

arXiv:2603.18806v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) introduce a new paradigm for language generation, which in turn presents new challenges for aligning them with human preferences. In this work, we aim to improve the policy optimization for...

1 min 4 weeks, 1 day ago

bit

LOW Academic International

Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism

arXiv:2603.18712v1 Announce Type: new Abstract: The task of multi-channel time series forecasting is ubiquitous in numerous fields such as finance, supply chain management, and energy planning. It is critical to effectively capture complex dynamic dependencies within and between channels for...

1 min 4 weeks, 1 day ago

bit

Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO

Collaborative Adaptive Curriculum for Progressive Knowledge Distillation

Probing the Latent World: Emergent Discrete Symbols and Physical Structure in Latent Representations

The Multiverse of Time Series Machine Learning: an Archive for Multivariate Time Series Classification

Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

Reinforcement Learning from Multi-Source Imperfect Preferences: Best-of-Both-Regimes Regret

LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Bayesian Learning in Episodic Zero-Sum Games

CFNN: Continued Fraction Neural Network

Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness

MAPLE: Metadata Augmented Private Language Evolution

When both Grounding and not Grounding are Bad -- A Partially Grounded Encoding of Planning into SAT (Extended Version)

From Comprehension to Reasoning: A Hierarchical Benchmark for Automated Financial Research Reporting

Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword Merging

LLM-MRD: LLM-Guided Multi-View Reasoning Distillation for Fake News Detection

BrainSCL: Subtype-Guided Contrastive Learning for Brain Disorder Diagnosis

The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference

Elon Musk unveils chip manufacturing plans for SpaceX and Tesla

Unanimous court allows street preacher’s free speech case to move forward

An Onto-Relational-Sophic Framework for Governing Synthetic Minds

BenchBrowser -- Collecting Evidence for Evaluating Benchmark Validity

Multi-Trait Subspace Steering to Reveal the Dark Side of Human-AI Interaction

MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation

AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba

Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably

AS2 -- Attention-Based Soft Answer Sets: An End-to-End Differentiable Neuro-Soft-Symbolic Reasoning Architecture

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange Stories Paradigm

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.