Tag: cs.FL

#cs.FL

Academic · 1 min

Warm Starting State-Space Models with Automata Learning

arXiv:2603.05694v1 Announce Type: new Abstract: We prove that Moore machines can be exactly realized as state-space models (SSMs), establishing a formal correspondence between symbolic automata …

William Fishell, Sam Nicholas Kouteili, Mark Santolucito
16 views
Academic · 1 min

Continuous Diffusion Models Can Obey Formal Syntax

arXiv:2602.12468v1 Announce Type: new Abstract: Diffusion language models offer a promising alternative to autoregressive models due to their global, non-causal generation process, but their continuous …

Jinwoo Kim, Taylor Berg-Kirkpatrick, Loris D'Antoni
16 views
Academic · 1 min

Why Are Linear RNNs More Parallelizable?

arXiv:2603.03612v1 Announce Type: new Abstract: The community is increasingly exploring linear RNNs (LRNNs) as language models, motivated by their expressive power and parallelizability. While prior …

William Merrill, Hongjian Jiang, Yanhong Li, Ashish Sabharwal
17 views
Academic · 1 min

Length Generalization Bounds for Transformers

arXiv:2603.02238v1 Announce Type: new Abstract: Length generalization is a key property of a learning algorithm that enables it to make correct predictions on inputs of …

Andy Yang, Pascal Bergstr\"a{\ss}er, Georg Zetzsche, David Chiang, Anthony W. Lin
17 views