Do Personality Traits Interfere? Geometric Limitations of Steering in Large Language Models
arXiv:2602.15847v1 Announce Type: cross Abstract: Personality steering in large language models (LLMs) commonly relies on injecting trait-specific steering vectors, implicitly assuming that personality traits can …
Pranav Bhandari, Usman Naseem, Mehwish Nasim
7 views