HBVLA: Pushing 1-Bit Post-Training Quantization for Vision-Language-Action Models
arXiv:2602.13710v1 Announce Type: new Abstract: Vision-Language-Action (VLA) models enable instruction-following embodied control, but their large compute and memory footprints hinder deployment on resource-constrained robots and edge platforms. While reducing weights to 1-bit precision through binarization can greatly improve efficiency, existing...
Discrete Double-Bracket Flows for Isotropic-Noise Invariant Eigendecomposition
arXiv:2602.13759v1 Announce Type: new Abstract: We study matrix-free eigendecomposition under a matrix-vector product (MVP) oracle, where each step observes a covariance operator $C_k = C_{sig} + \sigma_k^2 I + E_k$. Standard stochastic approximation methods either use fixed steps that couple...
On Representation Redundancy in Large-Scale Instruction Tuning Data Selection
arXiv:2602.13773v1 Announce Type: new Abstract: Data quality is a crucial factor in large language models training. While prior work has shown that models trained on smaller, high-quality datasets can outperform those trained on much larger but noisy or low-quality corpora,...
Cast-R1: Learning Tool-Augmented Sequential Decision Policies for Time Series Forecasting
arXiv:2602.13802v1 Announce Type: new Abstract: Time series forecasting has long been dominated by model-centric approaches that formulate prediction as a single-pass mapping from historical observations to future values. Despite recent progress, such formulations often struggle in complex and evolving settings,...
Testing For Distribution Shifts with Conditional Conformal Test Martingales
arXiv:2602.13848v1 Announce Type: new Abstract: We propose a sequential test for detecting arbitrary distribution shifts that allows conformal test martingales (CTMs) to work under a fixed, reference-conditional setting. Existing CTM detectors construct test martingales by continually growing a reference set...
Why Code, Why Now: Learnability, Computability, and the Real Limits of Machine Learning
arXiv:2602.13934v1 Announce Type: new Abstract: Code generation has progressed more reliably than reinforcement learning, largely because code has an information structure that makes it learnable. Code provides dense, local, verifiable feedback at every token, whereas most reinforcement learning problems do...
Apple is reportedly cooking up a trio of AI wearables
As the AI hardware space heats up, the iPhone maker has multiple smart products in development.
Adani pledges $100B to build AI data centers as India seeks bigger role in the global AI race
Adani's plan targets up to 5 gigawatts of capacity, with data centers planned alongside partnerships with Google, Microsoft, and Flipkart.
As AI jitters rattle IT stocks, Infosys partners with Anthropic to build ‘enterprise-grade’ AI agents
Under the partnership, Infosys plans to integrate Anthropic's Claude models into its Topaz AI platform to build so-called "agentic" systems.