Rescaling Confidence: What Scale Design Reveals About LLM Metacognition
arXiv:2603.09309v1 Announce Type: new Abstract: Verbalized confidence, in which LLMs report a numerical certainty score, is widely used to estimate uncertainty in black-box settings, yet …
Yuyang Dai
4 views