In-Context Environments Induce Evaluation-Awareness in Language Models
arXiv:2603.03824v1 Announce Type: new Abstract: Humans often become more self-aware under threat, yet can lose self-awareness when absorbed in a task; we hypothesize that language …
Maheep Chaudhary
3 views