Controllable and explainable personality sliders for LLMs at inference time
arXiv:2603.03326v1 Announce Type: cross Abstract: Aligning Large Language Models (LLMs) with specific personas typically relies on expensive and monolithic Supervised Fine-Tuning (SFT) or RLHF. While …
Florian Hoppe, David Khachaturov, Robert Mullins, Mark Huasong Meng
4 views