Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?
arXiv:2602.23225v1 Announce Type: new Abstract: Diffusion Language Models (DLMs) are often advertised as enabling parallel token generation, yet practical fast DLMs frequently converge to left-to-right, …
Pengxiang Li, Dilxat Muhtar, Lu Yin, Tianlong Chen, Shiwei Liu
29 views