Think-While-Generating: On-the-Fly Reasoning for Personalized Long-Form Generation
arxiv.org·21h
🔨Compilers
Preview
Report Post

View PDF HTML (experimental)

Abstract:Preference alignment has enabled large language models (LLMs) to better reflect human expectations, but current methods mostly optimize for population-level preferences, overlooking individual users. Personalization is essential, yet early approaches-such as prompt customization or fine-tuning-struggle to reason over implicit preferences, limiting real-world effectiveness. Recent "think-then-generate" methods address this by reasoning before response generation. However, they face challenges in long-form generation: their static one-shot reasoning must capture all relevant information for the full response generation, making learning difficult and limiting adaptabili…

Similar Posts

Loading similar posts...