This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Quality follows upgrading

Xingwu Chen, Zhanqiu Zhang, Yiwen Guo, Difan Zou

Articles by Xingwu Chen, Zhanqiu Zhang, Yiwen Guo, Difan Zou

Academic · 1 min

Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction

arXiv:2603.04783v1 Announce Type: new Abstract: While LLMs demonstrate strong reasoning capabilities when provided with full information in a single turn, they exhibit substantial vulnerability in …

25 views Mar 7

Xingwu Chen, Zhanqiu Zhang, Yiwen Guo, Difan Zou

Articles by Xingwu Chen, Zhanqiu Zhang, Yiwen Guo, Difan Zou

Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction

JCG, PC

HSOLLC Co., Ltd.