Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Interpretability
🔎 AI Interpretability
mechanistic interpretability, explainable AI, XAI, saliency
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
76
posts in
10.2
ms
🎮
Reinforcement Learning
arXiv
·
2d
2 days ago
Themis: An
explainable
AI-enabled
framework for Reinforcement Learning with Human Feedback
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Themis: An explainable AI-enabled framework for Reinforcement Learning with Human Feedback
🧠
Transformer Architecture
arXiv
·
11h
11 hours ago
Cascaded Multi-Granularity Pruning for On-Device LLM Inference in Industrial IoT
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cascaded Multi-Granularity Pruning for On-Device LLM Inference in Industrial IoT
🔍
Interpretability
arXiv
·
2d
2 days ago
Evaluating the
Interpretability
of
Sparse
Autoencoders
with Concept Annotations
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Evaluating the Interpretability of Sparse Autoencoders with Concept Annotations
🤖
人工智能
arXiv
·
1d
1 day ago
What's in an Earth Embedding? An
Explainability
Analysis
of Location Encoders
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What's in an Earth Embedding? An Explainability Analysis of Location Encoders
🔍
Interpretability
arXiv
·
2d
2 days ago
Ensemble
Feature
Selection and Harris Hawks Optimization for
Explainable
Mental
Health
Risk Prediction in Female Sex Workers
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Ensemble Feature Selection and Harris Hawks Optimization for Explainable Mental Health Risk Prediction in Female Sex Workers
🤖
AI
arXiv
·
3d
3 days ago
Asset Pricing in Pre-trained
Transformer
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Asset Pricing in Pre-trained Transformer
🤖
人工智能
arXiv
·
2d
2 days ago
Similarity of
Neural
Network
Representations in Superposition
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Similarity of Neural Network Representations in Superposition
🔍
Interpretability
arXiv
·
3d
3 days ago
A Differentiable Atari VCS:A Complex, Fully Known Ground Truth for
Explainable
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Differentiable Atari VCS:A Complex, Fully Known Ground Truth for Explainable AI
🔬
ML Research
arXiv
·
11h
11 hours ago
Localizing RL-Induced Tool Use to a Single Crosscoder
Feature
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Localizing RL-Induced Tool Use to a Single Crosscoder Feature
🔬
AI Research
arXiv
·
11h
11 hours ago
Refusal Lives Downstream of Persona in Chat Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Refusal Lives Downstream of Persona in Chat Models
🗣️
Large Language Models
arXiv
·
2d
2 days ago
Sentence-Level Contextual Entrainment in Large Language Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Sentence-Level Contextual Entrainment in Large Language Models
🔍
Interpretability
arXiv
·
3d
3 days ago
Towards Transparent Mental
Health
Insights: An
Explainable
AI
Model for Career-Related Depression and Anxiety Among University Students Using Structured Data
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Towards Transparent Mental Health Insights: An Explainable AI Model for Career-Related Depression and Anxiety Among University Students Using Structured Data
🧠
LLM Research
arXiv
·
3d
3 days ago
Beyond Importance: Interchange-Sobol Sensitivity Reveals Task-Specific Content Channels in
Transformer
Components
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Beyond Importance: Interchange-Sobol Sensitivity Reveals Task-Specific Content Channels in Transformer Components
🔍
Interpretability
arXiv
·
11h
11 hours ago
From Weights to
Features
: SAE-Guided Activation Regularization for LLM Continual Learning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Weights to Features: SAE-Guided Activation Regularization for LLM Continual Learning
🔬
ML Research
arXiv
·
3d
3 days ago
Few-Shot Hyperspectral Aphid Detection via FastGAN Synthetic Data Generation,
Transformer-Based
Classification and
Explainable
AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Few-Shot Hyperspectral Aphid Detection via FastGAN Synthetic Data Generation, Transformer-Based Classification and Explainable AI
⚡
Transformers
arXiv
·
3d
3 days ago
Grouped Query Experts: Mixture-of-Experts on GQA
Self-Attention
Covered by
ai-brief.liziran.com
,
Turing Post
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention
⚡
LLM Optimization
arXiv
·
2d
2 days ago
CompressKV: Semantic-Retrieval-Guided KV-Cache Compression for Resource-Efficient Long-Context LLM Inference
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for CompressKV: Semantic-Retrieval-Guided KV-Cache Compression for Resource-Efficient Long-Context LLM Inference
🧠
LLM Training
arXiv
·
1d
1 day ago
Perfect Detection, Failed Control: The Geometry of Knowing vs. Steering in Language Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Perfect Detection, Failed Control: The Geometry of Knowing vs. Steering in Language Models
🤖
人工智能
arXiv
·
3d
3 days ago
Explanations for Automatic Speech Recognition
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Explanations for Automatic Speech Recognition
🤖
人工智能
arXiv
·
3d
3 days ago
Extraction and
Analysis
of Multimodal Concepts in Vision Language Models through
Sparse
Autoencoders
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Extraction and Analysis of Multimodal Concepts in Vision Language Models through Sparse Autoencoders
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report