← heapsort
RESEARCH27

Learning Transferable Latent User Preferences for Human-Aligned Decision Making

arXiv CS.AIΒ·May 14, 2026

This paper introduces CLIPR, a framework designed to enable Large Language Models (LLMs) to make human-aligned decisions by inferring latent user preferences from limited interactions. It addresses the challenge of LLMs struggling with human alignment and the limitations of existing approaches in generalizing preferences across tasks.

Read original β†—