PACED: Distillation at the Frontier of Student Competence
arXiv:2603.11178v1 Announce Type: new Abstract: Standard LLM distillation wastes compute on two fronts: problems the student has already mastered (near-zero gradients) and problems far beyond …
Yuanda Xu, Hejian Sang, Zhengze Zhou, Ran He, Zhipeng Wang
9 views