🔍 AI Interpretability - jimman · Scour

Mechanistic Interpretability: The Key to Trusting Agentic AI

✍️Prompt Engineering Discussion

bradenkelley.com·

Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects

⚡LLM Optimization Academic

VFUSE: Virulent Feature Understanding with Sparse autoEncoders

⚡LLM Optimization Academic

Mythos and the Adolescence of AI Policy

✍️Prompt Engineering News

luizasnewsletter.com·

Silicon Valley found AI and started looking for God

💻Tech News

··r/OpenAI, r/artificial

The Rival Theologies of Artificial Intelligence

✍️Prompt Engineering News

letter.palladiummag.com·

Less-relevant results

Don't let the LLM speak, just probe it (8 minute read)

🤖AI Blog

[Paper] Dictionary Learning Identifiability for Understanding SAEs

⚡LLM Optimization

lesswrong.com·

Is the Space Pope Reptilian?

✍️Prompt Engineering News

tearsinrain.ai··Hacker News

The Calculated Spectacle Behind Magnifica Humanitas

✍️Prompt Engineering

firstthings.com·

Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders

⚡LLM Optimization Academic

Playing with Vision Embeddings

⚡Model Efficiency

prestonbjensen.com··Hacker News

Best explanations of how LLMs work

⚡LLM Optimization Blog

vorushin.github.io··Hacker News

FoldSAE: Learning to Steer Protein Folding Through Sparse Representations

⚡LLM Optimization Academic

The technical community can't be the main character in AI safety anymore

substackcdn.com··Substack

Compositional and interpretable representation of histology using AI foundation models and sparse autoencoders

⚡LLM Optimization Academic

Interactions Between Crosscoder Features: A Compact Proofs Perspective

⚡LLM Optimization Academic

Coelho Mollo and Millière: The Vector Grounding Problem

philosophyofbrains.com·

SAE It Across Models: Explaining Features With Foreign NLA Verbalizers

⚡LLM Optimization

lesswrong.com·

Sparse Autoencoders Reveal Interpretable and Steerable Features in VLA Models

⚡LLM Optimization Academic

Log in to enable infinite scrolling