Interpretability

Feeds to Scour
SubscribedAll
Scoured 72 posts in 11.5 ms

Subspace-Aware Sparse Autoencoders for Effective Mechanistic Interpretability

 λType Theory  Content type: Academic
arxiv.org·

Kim Soo-Hyun Returns To Work? Actor To Shoot For Commercial Amid Kim Sae-Ron Controversy

 📡Information Theory  Content type: News
in.mashable.com·

[Paper] Dictionary Learning Identifiability for Understanding SAEs

 📡Information Theory
lesswrong.com·
Less-relevant results

Korean actor Kim Soo-hyun to shoot first ad campaign since controversy amid advertiser lawsuits

 📡Information Theory

Mechanistic Interpretability: The Key to Trusting Agentic AI

 🔬Philosophy of Science  Content type: Discussion
bradenkelley.com·

Asia faces risks of economic spillover from Iran and AI disinformation

 🌀Complexity Science  Content type: News
asia.nikkei.com·

VFUSE: Virulent Feature Understanding with Sparse autoEncoders

 📡Information Theory  Content type: Academic
arxiv.org·

Compositional and interpretable representation of histology using AI foundation models and sparse autoencoders

 λType Theory  Content type: Academic
biorxiv.org·

A Geometric View for Understanding Concept Learning and Neuron Interpretation in Sparse Autoencoders

 📡Information Theory  Content type: Academic
arxiv.org·

Two UWM Baja cars, one banner season for engineering students

 🔗Interdisciplinary  Content type: Academic
uwm.edu·

Playing with Vision Embeddings

 📡Information Theory

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

 ⚙️Compilers  Content type: Academic
arxiv.org·

Viewing the ESEAP Conference Through the Eyes of People with Neurodiversity

 🧠Cognitive Science
diff.wikimedia.org·

FoldSAE: Learning to Steer Protein Folding Through Sparse Representations

 λType Theory  Content type: Academic
arxiv.org·

How to drive automotive technology innovation during China’s 15th Five-Year Plan period

 🔬Philosophy of Science
autonews.gasgoo.com·

Interactions Between Crosscoder Features: A Compact Proofs Perspective

 λType Theory  Content type: Academic
arxiv.org·

SAE It Across Models: Explaining Features With Foreign NLA Verbalizers

 📡Information Theory
lesswrong.com·

ALPHA DRIVE ONE And Izna Goes Global In Partnership With REPUBLIC

 🕸️Network Theory
forbes.com·

Pre-Intervention Prediction of Sparse Autoencoder Steering Side Effects

 📡Information Theory  Content type: Academic
arxiv.org·

BioByte 162: The Hype of Virtual Cells, ESMC's AlphaFold3-Like Performance, and the Prediction of Antibody Non-Specificity

 λType Theory  Content type: Blog

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help