Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach
arXiv:2603.20899v1 Announce Type: new Abstract: Large language models exhibit strong reasoning capabilities, yet often rely on shortcuts such as surface pattern matching and answer memorization …
Hongyu Cao, Kunpeng Liu, Dongjie Wang, Yanjie Fu
4 views