Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
arXiv:2603.03242v1 Announce Type: new Abstract: Language models deployed in online communities must adapt to norms that vary across social, cultural, and domain-specific contexts. Prior alignment …
Patrick Gerard, Svitlana Volkova
4 views