Arbitration

LOW Academic International

Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models

arXiv:2602.22072v1 Announce Type: new Abstract: Theory of Mind (ToM) refers to an agent's ability to model the internal states of others. Contributing to the debate whether large language models (LLMs) exhibit genuine ToM capabilities, our study investigates their ToM robustness...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Training Generalizable Collaborative Agents via Strategic Risk Aversion

arXiv:2602.21515v1 Announce Type: new Abstract: Many emerging agentic paradigms require agents to collaborate with one another (or people) to achieve shared goals. Unfortunately, existing approaches to learning policies for such collaborative problems produce brittle solutions that fail when paired with...

1 min 1 month, 3 weeks ago

bit

LOW News International

Sophia Space raises $10M seed to demo novel space computers

The company's modular computer tiles offer a new vision for space data centers.

1 min 1 month, 3 weeks ago

bit

LOW News International

Exhibit in Boston’s startup ecosystem at TechCrunch Founder Summit 2026

On June 9, over 1,000 founders, investors, and decision-makers will gather for TechCrunch Founder Summit 2026. This isn’t just foot traffic. It’s a full day of concentrated deal flow.

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

arXiv:2602.21103v1 Announce Type: new Abstract: Advanced reasoning typically requires Chain-of-Thought prompting, which is accurate but incurs prohibitive latency and substantial test-time inference costs. The standard alternative, fine-tuning smaller models, often sacrifices interpretability while introducing significant resource and operational overhead. To...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs

arXiv:2602.20191v1 Announce Type: cross Abstract: Changing runtime complexity on cloud and edge devices necessitates elastic large language model (LLM) deployment, where an LLM can be inferred with various quantization precisions based on available computational resources. However, it has been observed...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Learning to Solve Complex Problems via Dataset Decomposition

arXiv:2602.20296v1 Announce Type: new Abstract: Curriculum learning is a class of training strategies that organizes the data being exposed to a model by difficulty, gradually from simpler to more complex examples. This research explores a reverse curriculum generation approach that...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

arXiv:2602.20309v1 Announce Type: new Abstract: Vision-language-action (VLA) models unify perception, language, and control for embodied agents but face significant challenges in practical deployment due to rapidly increasing compute and memory demands, especially as models scale to longer horizons and larger...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

CaDrift: A Time-dependent Causal Generator of Drifting Data Streams

arXiv:2602.20329v1 Announce Type: new Abstract: This work presents Causal Drift Generator (CaDrift), a time-dependent synthetic data generator framework based on Structural Causal Models (SCMs). The framework produces a virtually infinite combination of data streams with controlled shift events and time-dependent...

1 min 1 month, 3 weeks ago

adr

LOW Academic International

How Do LLMs Encode Scientific Quality? An Empirical Study Using Monosemantic Features from Sparse Autoencoders

arXiv:2602.19115v1 Announce Type: new Abstract: In recent years, there has been a growing use of generative AI, and large language models (LLMs) in particular, to support both the assessment and generation of scientific work. Although some studies have shown that...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Retrieval Augmented Enhanced Dual Co-Attention Framework for Target Aware Multimodal Bengali Hateful Meme Detection

arXiv:2602.19212v1 Announce Type: new Abstract: Hateful content on social media increasingly appears as multimodal memes that combine images and text to convey harmful narratives. In low-resource languages such as Bengali, automated detection remains challenging due to limited annotated data, class...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference

arXiv:2602.19509v1 Announce Type: new Abstract: Large Language Models (LLMs) face a persistent trade-off between inference cost and reasoning capability. While "Oracle" models (e.g., Llama-3-70B) achieve state-of-the-art accuracy, they are prohibitively expensive for high-volume deployment. Smaller models (e.g., 8B parameters) are...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework

arXiv:2602.19549v1 Announce Type: new Abstract: Visual Document Retrieval (VDR), which aims to retrieve relevant pages within vast corpora of visually-rich documents, is of significance in current multimodal retrieval applications. The state-of-the-art multi-vector paradigm excels in performance but suffers from prohibitive...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

AdaptStress: Online Adaptive Learning for Interpretable and Personalized Stress Prediction Using Multivariate and Sparse Physiological Signals

arXiv:2602.18521v1 Announce Type: new Abstract: Continuous stress forecasting could potentially contribute to lifestyle interventions. This paper presents a novel, explainable, and individualized approach for stress prediction using physiological data from consumer-grade smartwatches. We develop a time series forecasting model that...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure

arXiv:2602.18523v1 Announce Type: new Abstract: Grokking -- the abrupt transition from memorization to generalization long after near-zero training loss -- has been studied mainly in single-task settings. We extend geometric analysis to multi-task modular arithmetic, training shared-trunk Transformers on dual-task...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry

arXiv:2602.18584v1 Announce Type: new Abstract: Targeted data selection has emerged as a crucial paradigm for efficient instruction tuning, aiming to identify a small yet influential subset of training examples for a specific target task. In practice, influence is often measured...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

arXiv:2602.18795v1 Announce Type: new Abstract: Latent Dirichlet Allocation (LDA) is a foundational model for discovering latent thematic structure in discrete data, but its Dirichlet prior cannot represent the rich correlations and hierarchical relationships often present among topics. We introduce the...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

VariBASed: Variational Bayes-Adaptive Sequential Monte-Carlo Planning for Deep Reinforcement Learning

arXiv:2602.18857v1 Announce Type: new Abstract: Optimally trading-off exploration and exploitation is the holy grail of reinforcement learning as it promises maximal data-efficiency for solving any task. Bayes-optimal agents achieve this, but obtaining the belief-state and performing planning are both typically...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse

arXiv:2602.18904v1 Announce Type: new Abstract: Vector-quantized autoencoders deliver high-fidelity latents but suffer inherent flaws: the quantizer is non-differentiable, requires straight-through hacks, and is prone to collapse. We address these issues at the root by replacing VQ with a simple, principled,...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Financial time series augmentation using transformer based GAN architecture

arXiv:2602.17865v1 Announce Type: cross Abstract: Time-series forecasting is a critical task across many domains, from engineering to economics, where accurate predictions drive strategic decisions. However, applying advanced deep learning models in challenging, volatile domains like finance is difficult due to...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Decomposing Retrieval Failures in RAG for Long-Document Financial Question Answering

arXiv:2602.17981v1 Announce Type: new Abstract: Retrieval-augmented generation is increasingly used for financial question answering over long regulatory filings, yet reliability depends on retrieving the exact context needed to justify answers in high stakes settings. We study a frequent failure mode...

1 min 1 month, 3 weeks ago

adr

LOW Academic International

VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning

arXiv:2602.18429v1 Announce Type: new Abstract: Large Language Models (LLMs) have made significant progress in reasoning tasks across various domains such as mathematics and coding. However, their performance deteriorates in tasks requiring rich socio-cultural knowledge and diverse local contexts, particularly those...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

arXiv:2602.17681v1 Announce Type: cross Abstract: Post-training quantization (PTQ) is a widely used approach for reducing the memory and compute costs of large language models (LLMs). Recent studies have shown that applying invertible transformations to activations can significantly improve quantization robustness...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering

arXiv:2602.17691v1 Announce Type: cross Abstract: Quantized language models face a fundamental dilemma: low sampling temperatures yield repetitive, mode-collapsed outputs, while high temperatures (T > 2.0) cause trajectory divergence and semantic incoherence. We present HELIX, a geometric framework that decouples output...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory

arXiv:2602.18297v1 Announce Type: cross Abstract: Chain-of-thought (CoT) monitors are LLM-based systems that analyze reasoning traces to detect when outputs may exhibit attributes of interest, such as test-hacking behavior during code generation. In this paper, we use information-theoretic analysis to show...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs

arXiv:2602.17680v1 Announce Type: new Abstract: Existing Protein Language Models (PLMs) often suffer from limited adaptability to multiple tasks and exhibit poor generalization across diverse biological contexts. In contrast, general-purpose Large Language Models (LLMs) lack the capability to interpret protein sequences...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Parallel Complex Diffusion for Scalable Time Series Generation

arXiv:2602.17706v1 Announce Type: new Abstract: Modeling long-range dependencies in time series generation poses a fundamental trade-off between representational capacity and computational efficiency. Traditional temporal diffusion models suffer from local entanglement and the $\mathcal{O}(L^2)$ cost of attention mechanisms. We address these...

1 min 1 month, 3 weeks ago

adr

LOW Academic International

Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

arXiv:2602.17829v1 Announce Type: new Abstract: Inferring causal relations in timeseries data with delayed effects is a fundamental challenge, especially when the underlying system exhibits complex dynamics that cannot be captured by simple functional mappings. Traditional approaches often fail to produce...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

MePoly: Max Entropy Polynomial Policy Optimization

arXiv:2602.17832v1 Announce Type: new Abstract: Stochastic Optimal Control provides a unified mathematical framework for solving complex decision-making problems, encompassing paradigms such as maximum entropy reinforcement learning(RL) and imitation learning(IL). However, conventional parametric policies often struggle to represent the multi-modality of...

1 min 1 month, 3 weeks ago

bit

LOW Academic International

Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models

arXiv:2602.17846v1 Announce Type: new Abstract: Diffusion models generate high-quality samples but can also memorize training data, raising serious privacy concerns. Understanding the mechanisms governing when memorization versus generalization occurs remains an active area of research. In particular, it is unclear...

1 min 1 month, 3 weeks ago

bit

Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models

Training Generalizable Collaborative Agents via Strategic Risk Aversion

Sophia Space raises $10M seed to demo novel space computers

Exhibit in Boston’s startup ecosystem at TechCrunch Founder Summit 2026

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs

Learning to Solve Complex Problems via Dataset Decomposition

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

CaDrift: A Time-dependent Causal Generator of Drifting Data Streams

How Do LLMs Encode Scientific Quality? An Empirical Study Using Monosemantic Features from Sparse Autoencoders

Retrieval Augmented Enhanced Dual Co-Attention Framework for Target Aware Multimodal Bengali Hateful Meme Detection

Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference

Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework

AdaptStress: Online Adaptive Learning for Interpretable and Personalized Stress Prediction Using Multivariate and Sparse Physiological Signals

The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure

GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry

Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

VariBASed: Variational Bayes-Adaptive Sequential Monte-Carlo Planning for Deep Reinforcement Learning

PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse

Financial time series augmentation using transformer based GAN architecture

Decomposing Retrieval Failures in RAG for Long-Document Financial Question Answering

VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning

LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering

Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory

BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs

Parallel Complex Diffusion for Scalable Time Series Generation

Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

MePoly: Max Entropy Polynomial Policy Optimization

Two Calm Ends and the Wild Middle: A Geometric Picture of Memorization in Diffusion Models

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.