Prompt-Dependent Ranking of Large Language Models with Uncertainty Quantification
arXiv:2603.03336v1 Announce Type: new Abstract: Rankings derived from pairwise comparisons are central to many economic and computational systems. In the context of large language models …
Angel Rodrigo Avelar Menendez, Yufeng Liu, Xiaowu Dai
10 views