VRM: Teaching Reward Models to Understand Authentic Human Preferences
arXiv:2603.04974v1 Announce Type: new Abstract: Large Language Models (LLMs) have achieved remarkable success across diverse natural language tasks, yet the reward models employed for aligning …
Biao Liu, Ning Xu, Junming Yang, Hao Xu, Xin Geng
20 views