SCOPE: Selective Conformal Optimized Pairwise LLM Judging
arXiv:2602.13110v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used as judges to replace costly human preference labels in pairwise evaluation. Despite their …
Sher Badshah, Ali Emami, Hassan Sajjad
3 views