🔬 Interpretability - sunzhongxiang · Scour

Mechanistic Interpretability: The Key to Trusting Agentic AI

🤖Agent Discussion

bradenkelley.com·

Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects

🔍RAG Academic

rentruewang/inversql: Create SQL that match your selection (with explainable AI), not the other way around

🔍RAG Code

github.com··Hacker News

What shapes your power bill? Explainable AI outlines forecasts behind grid and price decisions

💾Memory Systems

techxplore.com·

How Does XAI Actually Work? A Look at SHAP and LIME in Cybersecurity

🎯Alignment Blog

Building MalTrace: A Behavioral Malware Analysis Pipeline with Explainable AI

🧠Cognitive Neurosciens for AI Blog

·

Compositional and interpretable representation of histology using AI foundation models and sparse autoencoders

🎨Multimodal AI Academic

AI Predicts Brain Tumor Molecular Subtypes in Twelve Minutes

🧠Neuroscience

neurosciencenews.com·

Playing with Vision Embeddings

🎨Multimodal AI

prestonbjensen.com··Hacker News

A Practical Guide to Assessing Agentic AI Companies for Enterprise Needs

netnewsledger.com·

[Paper] Dictionary Learning Identifiability for Understanding SAEs

💾Memory Systems

lesswrong.com·

One Lens, Many Worlds : A Capability-Typed Interface for World-Model Interpretability

💾Memory Systems Academic

Less-relevant results

Is the Space Pope Reptilian?

🎯Alignment News

tearsinrain.ai··Hacker News

DataXflowGen for GenAI-driven model generation

🎨Multimodal AI Academic

Anish-185/Production-Line-Performance-Checker

🎯Alignment Code

github.com··r/coding

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

🎨Multimodal AI Academic

BioByte 162: The Hype of Virtual Cells, ESMC's AlphaFold3-Like Performance, and the Prediction of Antibody Non-Specificity

🔍RAG Blog

decodingbio.substack.com··Substack

Why Digital Identity Fraud Is Becoming a Bigger Threat to Financial Services

globalbankingandfinance.com·

Trajectory Geometry of Transformer Representations Across Layers

🧠Neuroscience Academic

Coelho Mollo and Millière: The Vector Grounding Problem

🦾Embodied AI

philosophyofbrains.com·

Log in to enable infinite scrolling