April 26, 2025

LoRe: Low-Rank Reward Modeling for Personalized LLMs

10 minutes

paper introduces LoRe, a novel Low-Rank Reward Modeling framework for personalizing large language models (LLMs). It addresses the limitations of traditional methods by learning a low-dimensional space of reward functions shared across users. Individual user preferences are then modeled as weighted combinations of these basis reward functions, enabling efficient adaptation and generalization to new users with limited data. This approach improves upon existing personalization techniques by avoiding rigid user categorizations and the need for extensive per-user data, ultimately enhancing the alignment of LLMs with diverse human preferences. LoRe also demonstrates seamless integration with multi-objective alignment frameworks for personalized response generation.

...more

View all episodes

By Enoch H. Kang

April 26, 2025

LoRe: Low-Rank Reward Modeling for Personalized LLMs

10 minutes

...more

Share LoRe: Low-Rank Reward Modeling for Personalized LLMs

Sign up to save your podcasts

LoRe: Low-Rank Reward Modeling for Personalized LLMs

LoRe: Low-Rank Reward Modeling for Personalized LLMs