Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences (opens in new tab) 🗳️Social Choice Content type: Academic

arxiv.org··Cited by 1 article·Open original

Current approaches to aligning large language models (LLMs) aggregate diverse human preferences into a single reward signal, effectively optimizing for a hypothetical ``average user'' who represents no real person particularly well. This position paper argues that LLMs should learn personalized, individual preferences rather than aggregated ones. We show that aggregation masks critical information about preference diversity, individual values,...

Read the original article

Sign in to keep reading the full article.

Sign Up Log In

Cited by 1 article

In other languages

AI 연구진, 안전성 및 효율성 벤치마크 논문 발표

kite.kagi.com·