🔍 Interpretability - taylor · Scour

Mechanistic Interpretability: The Key to Trusting Agentic AI

🔬Mech Interp Discussion

bradenkelley.com·

Closure-Validated Circuit Discovery in Attention Heads: Co-activation Proposes, Ablation Disposes

🔬Mech Interp Academic

Less-relevant results

Newman-Janis Algorithm from Taub-Newman-Unti-Tamburino Instantons

🔣Category Theory

Physicists create new family of Schrödinger-cat states

🏛️Vault Lighting

Compositional and interpretable representation of histology using AI foundation models and sparse autoencoders

🔬Mech Interp Academic

Physicists Built Quantum States So Strange They Only Existed In Theory, Until Now

🏛️Vault Lighting

studyfinds.com·

Time–temperature superposition of silicone rubber embedded with irregular-shaped magnetic particles under different magnetic fields

🔬Mech Interp Academic

Casa Tlaloc / Lopez Gonzalez Studio

🏗️Architecture

archdaily.com·

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

🔬Mech Interp Academic

[Paper] Dictionary Learning Identifiability for Understanding SAEs

🔬Mech Interp

lesswrong.com·

VFUSE: Virulent Feature Understanding with Sparse autoEncoders

🔬Mech Interp Academic

Cisco sees quantum networking as the future of networking

🌐Distributed Systems

networkworld.com·

Playing with Vision Embeddings

🔬Mech Interp

prestonbjensen.com··Hacker News

Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects

🔬Mech Interp Academic

Participant Observation

theoffingmag.com·

China unveils world’s first superfast quantum memory, paving way for practical computing

🐧Operating Systems Video News

Interactions Between Crosscoder Features: A Compact Proofs Perspective

🔬Mech Interp Academic

princezuda/-RequiemGPT-: Fully open source and open weights built and trained by fable five with one prompt. An experience in how AI actually works

🎲Procedural Generation Code

github.com··Hacker News

Shared Semantics, Divergent Mechanisms: Unsupervised Feature Discovery by Aligning Semantics and Mechanisms

🛡️AI Safety Academic

ORIGAMI: Orientation-Aware Graph Neural Network for Assessing Multimeric Interfaces of Protein Complex Structures

🔬Mech Interp Academic

Log in to enable infinite scrolling