Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation
arXiv:2604.03257v1 Announce Type: new Abstract: The ability to rigorously estimate the failure rates of large language models (LLMs) is a prerequisite for their safe deployment. …
Minghe Shen, Ananth Balashankar, Adam Fisch, David Madras, Miguel Rodrigues
8 views