Skip to main content

Academic

Academic

Academic · 1 min

Factored Latent Action World Models

arXiv:2602.16229v1 Announce Type: new Abstract: Learning latent actions from action-free video has emerged as a powerful paradigm for scaling up controllable world model learning. Latent …

Zizhao Wang, Chang Shi, Jiaheng Hu, Kevin Rohling, Roberto Mart\'in-Mart\'in, Amy Zhang, Peter Stone
5 views
Academic · 1 min

Fast KV Compaction via Attention Matching

arXiv:2602.16284v1 Announce Type: new Abstract: Scaling language models to long contexts is often bottlenecked by the size of the key-value (KV) cache. In deployed settings, …

Adam Zweiger, Xinghong Fu, Han Guo, Yoon Kim
5 views
Academic · 1 min

*-PLUIE: Personalisable metric with Llm Used for Improved Evaluation

arXiv:2602.15778v1 Announce Type: new Abstract: Evaluating the quality of automatically generated text often relies on LLM-as-a-judge (LLM-judge) methods. While effective, these approaches are computationally expensive …

Quentin Lemesle, L\'eane Jourdan, Daisy Munson, Pierre Alain, Jonathan Chevelu, Arnaud Delhay, Damien Lolive
8 views
Academic · 1 min

Avey-B

arXiv:2602.15814v1 Announce Type: new Abstract: Compact pretrained bidirectional encoders remain the backbone of industrial NLP under tight compute and memory budgets. Their effectiveness stems from …

Devang Acharya, Mohammad Hammoud
7 views
Academic · 1 min

Seeing to Generalize: How Visual Data Corrects Binding Shortcuts

arXiv:2602.15183v1 Announce Type: cross Abstract: Vision Language Models (VLMs) are designed to extend Large Language Models (LLMs) with visual capabilities, yet in this work we …

Nicolas Buzeta, Felipe del Rio, Cristian Hinostroza, Denis Parra, Hans Lobel, Rodrigo Toro Icarte
18 views