Reasoning Through Chess: How Reasoning Evolves from Data Through Fine-Tuning and Reinforcement Learning
arXiv:2604.05134v1 Announce Type: new Abstract: How can you get a language model to reason in a task it natively struggles with? We study how reasoning …
Lucas Dionisopoulos, Nicklas Majamaki, Prithviraj Ammanabrolu
4 views