Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning
arXiv:2602.17835v1 Announce Type: new Abstract: Supervised fine-tuning (SFT) relies critically on selecting training data that most benefits a model's downstream performance. Gradient-based data selection methods …