ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation
arXiv:2603.21140v1 Announce Type: new Abstract: Training large language models (LLMs) with synthetic reasoning data has become a popular approach to enhancing their reasoning capabilities, while …
Zhuojie Yang, Wentao Wan, Keze Wang
10 views