Rethinking Token Prediction: Tree-Structured Diffusion Language Model
arXiv:2604.03537v1 Announce Type: new Abstract: Discrete diffusion language models have emerged as a competitive alternative to auto-regressive language models, but training them efficiently under limited …
Zihao Wu, Haoming Yang, Juncheng Dong, Vahid Tarokh
5 views