Skip to main content

All Articles

Articles

Academic · 1 min

CAMEL: Confidence-Gated Reflection for Reward Modeling

arXiv:2602.20670v1 Announce Type: new Abstract: Reward models play a fundamental role in aligning large language models with human preferences. Existing methods predominantly follow two paradigms: …

Zirui Zhu, Hailun Xu, Yang Luo, Yong Liu, Kanchan Sarkar, Kun Xu, Yang You
9 views