Counterfactual Simulation Training for Chain-of-Thought Faithfulness
arXiv:2602.20710v1 Announce Type: new Abstract: Inspecting Chain-of-Thought reasoning is among the most common means of understanding why an LLM produced its output. But well-known problems …
Peter Hase, Christopher Potts
14 views