Characterizing Memorization in Diffusion Language Models: Generalized Extraction and Sampling Effects
arXiv:2603.02333v1 Announce Type: new Abstract: Autoregressive language models (ARMs) have been shown to memorize and occasionally reproduce training data verbatim, raising concerns about privacy and …
Xiaoyu Luo, Wenrui Yu, Qiongxiu Li, Johannes Bjerva
4 views