Large Language Model (LLM)-based generative frameworks are transforming how we approach both search and recommendation. Rather than maintaining separate, task-specific systems, these models provide a unified solution that simplifies design and can improve generalization across tasks. A central challenge, however, lies in how to represent items as discrete tokens that LLMs can process. Traditional approaches, such as using unique IDs, sequential IDs, or item titles, face limitations in scalability and in handling cold-start items.

Recent work has introduced Semantic IDs, where items are tokenized based on their embeddings. This allows similar items to share tokens, promoting better generalization. However, studies show that the effectiveness of Semantic IDs depends strongly on the …

Similar Posts

Loading similar posts...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help