TokenProbe visualizes how language models think in real-time by showing:

  • Token-by-token predictions
  • Probability distributions for each prediction
  • Alternative tokens the model considered

Customize your generation with these parameters:

  • Model: Pick a specific model (e.g. Llama 3.1 70B or Llama 3.1 8B etc)
  • Temperature: Controls randomness
  • Max tokens: Limits response length
  • Top-k: Number of alternative tokens shown

Test how the model converts natural language questions about movies into structured queries:

What movies won Best Picture Oscar in the 1990s?
{'query_type': 'award_search', 'award': 'Oscar', 'category': 'Best Picture', 'time_period': '1990s'}

Show me films directed by Christopher Nolan after 2010
{'query_type': 'director_search', 'director': 'Christoph...

Similar Posts

Loading similar posts...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help