This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Quality follows upgrading

Soham Gadgil, Chris Lin, Su-In Lee

Articles by Soham Gadgil, Chris Lin, Su-In Lee

Academic · 1 min

Where to Steer: Input-Dependent Layer Selection for Steering Improves LLM Alignment

arXiv:2604.03867v1 Announce Type: new Abstract: Steering vectors have emerged as a lightweight and effective approach for aligning large language models (LLMs) at inference time, enabling …

49 views Apr 7

Soham Gadgil, Chris Lin, Su-In Lee

Articles by Soham Gadgil, Chris Lin, Su-In Lee

Where to Steer: Input-Dependent Layer Selection for Steering Improves LLM Alignment

JCG, PC

HSOLLC Co., Ltd.