Token Prism
blog.scottlogic.com·13h
🌀Brotli Internals
Preview
Report Post

When working with agentic AI, tool calls and results abound. With tokens quicky mounting up, it’s common to want to visualise them, with the obvious solution being websites like OpenAI’s. However, using these with any more than toy data raises serious data privacy concerns. Consequently, I set out to make my own, more secure, offline solution.

Background

I feel I ought to explain what a token is before anything else: A token is a portion of text, generally sub-word. Given neural networks much prefer working with numbers, each token is mapped to a distinct ID and a sequence of them passed to a model for it to process.

When it comes to language models, tokenisers are usually trained alongside. They look at the same training data, but inst…

Similar Posts

Loading similar posts...