Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering
arXiv:2602.17691v1 Announce Type: cross Abstract: Quantized language models face a fundamental dilemma: low sampling temperatures yield repetitive, mode-collapsed outputs, while high temperatures (T > 2.0) …
Craig Atkinson
3 views