The Sequence AI of the Week #859: Reading Claude’s Mind in English: A Note on Natural Language Autoencoders (opens in new tab)

Discussed on Substack

Anthropic's fascinating new papers for the future of AI interpretability.

Read the original article

Sign in to keep reading the full article.