On the "Induction Bias" in Sequence Models
arXiv:2602.18333v1 Announce Type: cross Abstract: Despite the remarkable practical success of transformer-based language models, recent work has raised concerns about their ability to perform state …
M. Reza Ebrahimi, Micha\"el Defferrard, Sunny Panchal, Roland Memisevic
3 views