Self-Play Only Evolves When Self-Synthetic Pipeline Ensures Learnable Information Gain
arXiv:2603.02218v1 Announce Type: cross Abstract: Large language models (LLMs) make it plausible to build systems that improve through self-evolving loops, but many existing proposals are …
Wei Liu, Siya Qi, Yali Du, Yulan He
13 views