Certified Circuits: Stability Guarantees for Mechanistic Circuits
arXiv:2602.22968v1 Announce Type: new Abstract: Understanding how neural networks arrive at their predictions is essential for debugging, auditing, and deployment. Mechanistic interpretability pursues this goal …
Alaa Anani, Tobias Lorenz, Bernt Schiele, Mario Fritz, Jonas Fischer
4 views