Academic

EXACT: Explicit Attribute-Guided Decoding-Time Personalization

arXiv:2602.17695v1 Announce Type: cross Abstract: Achieving personalized alignment requires adapting large language models to each user's evolving context. While decoding-time personalization offers a scalable alternative to training-time methods, existing methods largely rely on implicit, less interpretable preference representations and impose a rigid, context-agnostic user representation, failing to account for how preferences shift across prompts. We introduce EXACT, a new decoding-time personalization that aligns generation with limited pairwise preference feedback using a predefined set of interpretable attributes. EXACT first identifies user-specific attribute subsets by maximizing the likelihood of preferred responses in the offline stage. Then, for online inference, EXACT retrieves the most semantically relevant attributes for an incoming prompt and injects them into the context to steer generation. We establish theoretical approximation guarantees for the proposed algorithm

X
Xin Yu, Hanwen Xing, Lingzhou Xue
· · 1 min read · 10 views

arXiv:2602.17695v1 Announce Type: cross Abstract: Achieving personalized alignment requires adapting large language models to each user's evolving context. While decoding-time personalization offers a scalable alternative to training-time methods, existing methods largely rely on implicit, less interpretable preference representations and impose a rigid, context-agnostic user representation, failing to account for how preferences shift across prompts. We introduce EXACT, a new decoding-time personalization that aligns generation with limited pairwise preference feedback using a predefined set of interpretable attributes. EXACT first identifies user-specific attribute subsets by maximizing the likelihood of preferred responses in the offline stage. Then, for online inference, EXACT retrieves the most semantically relevant attributes for an incoming prompt and injects them into the context to steer generation. We establish theoretical approximation guarantees for the proposed algorithm under mild assumptions, and provably show that our similarity-based retrieval mechanism effectively mitigates contextual preference shifts, adapting to disparate tasks without pooling conflicting preferences. Extensive experiments on human-annotated preference datasets demonstrate that EXACT consistently outperforms strong baselines, including preference modeling accuracy and personalized generation quality.

Executive Summary

The article 'EXACT: Explicit Attribute-Guided Decoding-Time Personalization' introduces a novel method for decoding-time personalization in large language models. EXACT leverages a predefined set of interpretable attributes to align generation with user preferences. The method involves offline identification of user-specific attribute subsets and online retrieval of semantically relevant attributes for an incoming prompt. Theoretical approximation guarantees and experimental results demonstrate EXACT's effectiveness in adapting to disparate tasks and mitigating contextual preference shifts. This approach has significant implications for personalized language generation and preference modeling accuracy.

Key Points

  • EXACT is a decoding-time personalization method that leverages interpretable attributes for personalized language generation.
  • The method involves offline attribute subset identification and online attribute retrieval for an incoming prompt.
  • EXACT demonstrates effectiveness in adapting to disparate tasks and mitigating contextual preference shifts.

Merits

Strength in Theoretical Foundations

The article provides theoretical approximation guarantees for the proposed algorithm, establishing a solid foundation for its credibility.

Practical Performance

EXACT outperforms strong baselines in preference modeling accuracy and personalized generation quality, demonstrating its practical effectiveness.

Demerits

Limited Interpretability

While EXACT uses interpretable attributes, the article does not provide a clear understanding of how these attributes are selected and weighted, which may limit the model's interpretability.

Scalability Concerns

The offline attribute subset identification stage may become computationally expensive for large datasets, which could impact EXACT's scalability.

Expert Commentary

The introduction of EXACT marks a significant advancement in decoding-time personalization, offering a novel approach that leverages interpretable attributes for personalized language generation. While the article's theoretical foundations and practical performance are notable strengths, limitations in interpretability and scalability require further investigation. As the field continues to evolve, EXACT's contributions will be essential in shaping the future of personalized language generation and preference modeling.

Recommendations

  • Future research should explore the extension of EXACT to more complex preference modeling scenarios, such as multi-task learning and transfer learning.
  • Developers and practitioners should carefully evaluate EXACT's performance in real-world applications, considering factors such as data quality, attribute selection, and computational resources.

Sources