Is gzip a language model? (opens in new tab)

Covered by tldr.tech, robinpie.neocities.orgDiscussed on Hacker News, Hacker News, and Lobsters

A while back I wrote about , where I generated Shakespeare with an unbounded n-gram model: no weights, no training, just counting. Fortuitously, I came across the paper , which mentioned the compression–prediction equivalence: every prediction model is inherently a compressor, and all compression algorithms are prediction models. This led to the natural question: can gzip do language modeling?1 No neural network, no learned parameters, nothing. Just the compressor that ships with your operati...

Read the original article

Sign in to keep reading the full article.

Sign Up Log In

Covered in 2 articles

tldr.tech·

Agentic coding for all 🏟, converse more 🗣, building AI-native startups 🛠

robinpie.neocities.org·

Playing with the language modeling abilities of gzip

Discussed on Hacker News