The Sequence AI of the Week #859: Reading Claude’s Mind in English: A Note on Natural Language Autoencoders (opens in new tab)
Anthropic's fascinating new papers for the future of AI interpretability.
Read the original articleAnthropic's fascinating new papers for the future of AI interpretability.
Read the original article