All Articles
Articles
Seeing to Generalize: How Visual Data Corrects Binding Shortcuts
arXiv:2602.15183v1 Announce Type: cross Abstract: Vision Language Models (VLMs) are designed to extend Large Language Models (LLMs) with visual capabilities, yet in this work we …
ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction
arXiv:2602.15189v1 Announce Type: cross Abstract: The use of large language models for web information extraction is becoming increasingly fundamental to modern web information retrieval pipelines. …
Weight space Detection of Backdoors in LoRA Adapters
arXiv:2602.15195v1 Announce Type: cross Abstract: LoRA adapters let users fine-tune large language models (LLMs) efficiently. However, LoRA adapters are shared through open repositories like Hugging …
Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems
arXiv:2602.15198v1 Announce Type: cross Abstract: Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks. This surfaces a …
How to Train Your Long-Context Visual Document Model
arXiv:2602.15257v1 Announce Type: cross Abstract: We present the first comprehensive, large-scale study of training long-context vision language models up to 344K context, targeting long-document visual …
FrameRef: A Framing Dataset and Simulation Testbed for Modeling Bounded Rational Information Health
arXiv:2602.15273v1 Announce Type: cross Abstract: Information ecosystems increasingly shape how people internalize exposure to adverse digital experiences, raising concerns about the long-term consequences for information …
The Information Geometry of Softmax: Probing and Steering
arXiv:2602.15293v1 Announce Type: cross Abstract: This paper concerns the question of how AI systems encode semantic structure into the geometric structure of their representation spaces. …
Prescriptive Scaling Reveals the Evolution of Language Model Capabilities
arXiv:2602.15327v1 Announce Type: cross Abstract: For deploying foundation models, practitioners increasingly need prescriptive scaling laws: given a pre training compute budget, what downstream accuracy is …
Discovering Implicit Large Language Model Alignment Objectives
arXiv:2602.15338v1 Announce Type: cross Abstract: Large language model (LLM) alignment relies on complex reward signals that often obscure the specific behaviors being incentivized, creating critical …
Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU
arXiv:2602.15707v1 Announce Type: cross Abstract: Real-time conversational assistants for procedural tasks often depend on video input, which can be computationally expensive and compromise user privacy. …
Near-Optimal Sample Complexity for Online Constrained MDPs
arXiv:2602.15076v1 Announce Type: new Abstract: Safety is a fundamental challenge in reinforcement learning (RL), particularly in real-world applications such as autonomous driving, robotics, and healthcare. …