When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger
arXiv:2603.04968v1 Announce Type: new Abstract: Preference alignment is an essential step in adapting large language models (LLMs) to human values, but existing approaches typically depend …
Amirabbas Afzali, Myeongho Jeon, Maria Brbic
3 views