ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification
arXiv:2602.18447v1 Announce Type: new Abstract: Chain-of-Thought reasoning significantly improves the performance of large language models on complex tasks, but incurs high inference latency due to …
Siran Liu, Cyril Y. He
4 views