Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control
arXiv:2603.09221v1 Announce Type: new Abstract: Associative memory has long underpinned the design of sequential models. Beyond recall, humans reason by projecting future states and selecting …
Peihao Wang, Shan Yang, Xijun Wang, Tesi Xiao, Xin Liu, Changlong Yu, Yu Lou, Pan Li, Zhangyang Wang, Ming Lin, Ren\'e Vidal
15 views