Semantic IDs for Generative Search and Recommendation
research.atspotify.com·9h

Large Language Model (LLM)-based generative frameworks are transforming how we approach both search and recommendation. Rather than maintaining separate, task-specific systems, these models provide a unified solution that simplifies design and can improve generalization across tasks. A central challenge, however, lies in how to represent items as discrete tokens that LLMs can process. Traditional approaches, such as using unique IDs, sequential IDs, or item titles, face limitations in scalability and in handling cold-start items.

Recent work has introduced Semantic IDs, where items are tokenized based on their embeddings. This allows similar items to share tokens, promoting better generalization. However, studies show that the effectiveness of Semantic IDs depends strongly on the …

Similar Posts

Loading similar posts...