Skip to main content

All Articles

Articles

Academic · 1 min

Disentangling Geometry, Performance, and Training in Language Models

arXiv:2602.20433v1 Announce Type: new Abstract: Geometric properties of Transformer weights, particularly the unembedding matrix, have been widely useful in language model interpretability research. Yet, their …

Atharva Kulkarni, Jacob Mitchell Springer, Arjun Subramonian, Swabha Swayamdipta
0 views
Academic · 1 min

Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning

arXiv:2602.20528v1 Announce Type: new Abstract: The Stop-Think-AutoRegress Language Diffusion Model (STAR-LDM) integrates latent diffusion planning with autoregressive generation. Unlike conventional autoregressive language models limited to …

Justin Lovelace, Christian Belardi, Sofian Zalouk, Adhitya Polavaram, Srivatsa Kundurthy, Kilian Q. Weinberger
0 views
Academic · 1 min

Personal Information Parroting in Language Models

arXiv:2602.20580v1 Announce Type: new Abstract: Modern language models (LM) are trained on large scrapes of the Web, containing millions of personal information (PI) instances, many …

Nishant Subramani, Kshitish Ghate, Mona Diab
0 views
Academic · 1 min

A Dynamic Survey of Soft Set Theory and Its Extensions

arXiv:2602.21268v1 Announce Type: new Abstract: Soft set theory provides a direct framework for parameterized decision modeling by assigning to each attribute (parameter) a subset of …

Takaaki Fujita, Florentin Smarandache
0 views
Academic · 1 min

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

arXiv:2602.21534v1 Announce Type: new Abstract: Agentic reinforcement learning (ARL) has rapidly gained attention as a promising paradigm for training agents to solve complex, multi-step interactive …

Xiaoxuan Wang, Han Zhang, Haixin Wang, Yidan Shi, Ruoyan Li, Kaiqiao Han, Chenyi Tong, Haoran Deng, Renliang Sun, Alexander Taylor, Yanqiao Zhu, Jason Cong, Yizhou Sun, Wei Wang
0 views