PREFDISCO: Evaluating Proactive Personalization through Interactive Preference Discovery

Best AI papers explained

Nov 12, 2025•15 min

--:--

Listen in podcast apps:

Apple Podcasts

Spotify

Download

Listen to this episode in Metacast mobile app

Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

This paper introduce a new meta-benchmark designed to evaluate large language models' (LLMs) ability to perform **interactive preference discovery** and response personalization through conversation. The framework converts existing benchmarks into interactive tasks by assigning **psychologically-grounded personas** with hidden preferences to be discovered by the AI. Evaluation of numerous frontier models showed that simply attempting personalization often **degraded performance** compared to generic responses (42.6% of cases), indicating systematic failures in current architectures. The research established a strong positive correlation between **question-asking volume** and preference alignment, but noted that models tend not to ask enough questions, and personalization also often imposes a **cognitive cost** that reduces task accuracy, particularly in mathematical reasoning. Ultimately, the source argues that interactive preference discovery is a **distinct capability** requiring dedicated architectural innovations rather than relying on emergent general language understanding.

For the best experience, listen in Metacast app for iOS or Android