On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking
arXiv:2602.16849v1 Announce Type: new Abstract: We present a comprehensive analysis of how two-layer neural networks learn features to solve the modular addition task. Our work …
Jianliang He, Leda Wang, Siyu Chen, Zhuoran Yang
8 views