QAQ: Bidirectional Semantic Coherence for Selecting High-Quality Synthetic Code Instructions
arXiv:2603.12165v1 Announce Type: new Abstract: Synthetic data has become essential for training code generation models, yet it introduces significant noise and hallucinations that are difficult …
Jiayin Lei, Ming Ma, Yunxi Duan, Chenxi Li, Tianming Yang
9 views