Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
alignment, AI reliability, guardrails, responsible AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
60
posts in
19.7
ms
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🤝
AI Agents
Content type:
Discussion
bradenkelley.com
·
6d
6 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
Risk Under Pressure: Compute-Aware Evaluation of
Adversarial
Robustness
in Language Models
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models
The Pope Found the Missing Layer in
AI
Alignment
🤖
AI Engineering
Content type:
Blog
chrisperkins505.medium.com
·
17h
17 hours ago
Actions for The Pope Found the Missing Layer in AI Alignment
The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
🤝
AI Agents
lesswrong.com
·
2d
2 days ago
Actions for The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
[Recorded talk] "
AI
Alignment
Versus
AI
Ethical Treatment: 10 Challenges"
🤖
AI Engineering
Content type:
Blog
meditationsondigitalminds.substack.com
·
3d
3 days ago
·
Substack
Actions for [Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"
Anthropic’s Bet: Interview with Dario Amodei
🤝
AI Agents
4sysops.com
·
1d
1 day ago
Actions for Anthropic’s Bet: Interview with Dario Amodei
VFUSE: Virulent Feature Understanding with Sparse autoEncoders
🧠
LLMs
Content type:
Academic
biorxiv.org
·
1d
1 day ago
Actions for VFUSE: Virulent Feature Understanding with Sparse autoEncoders
Criti-hyping is the best thing that happened to Big Tech
🕸️
Distributed Systems
reveriesofahuman.com
·
3d
3 days ago
Actions for Criti-hyping is the best thing that happened to Big Tech
Less-relevant results
Adam Smith's Creation of a "Large Model" - 36 Kr
🤝
AI Agents
eu.36kr.com
·
16h
16 hours ago
Actions for Adam Smith's Creation of a "Large Model" - 36 Kr
Guardian Angels: LLM Personalization for Productivity and Security
🧠
LLMs
3
sources covering this post
gwern.net
·
5d
5 days ago
·
Hacker News
·
Cited by 3 articles
Actions for Guardian Angels: LLM Personalization for Productivity and Security
AI
Will Not Start a Nuclear War, but Humans Might: Conclusions and Policy Recommendations The notion that
AI
could start a nuclear war may be attention-grabbing...
🤖
AI Engineering
ai-frontiers.org
·
2d
2 days ago
Actions for AI Will Not Start a Nuclear War, but Humans Might: Conclusions and Policy Recommendations The notion that AI could start a nuclear war may be attention-grabbing...
The crucial human component in computing and
AI
🤖
AI Engineering
Content type:
Academic
news.mit.edu
·
6d
6 days ago
Actions for The crucial human component in computing and AI
Solsong Chord Updates
🔍
RAG
jefftk.com
·
2d
2 days ago
Actions for Solsong Chord Updates
Cisco
AI
Defense Policy Studio: Turning Unwritten Policy into Adaptive
AI
Guardrails
🧠
LLMs
Content type:
Blog
blogs.cisco.com
·
1d
1 day ago
Actions for Cisco AI Defense Policy Studio: Turning Unwritten Policy into Adaptive AI Guardrails
Neglected Basics of
AI
Alignment
🧠
LLMs
lesswrong.com
·
5d
5 days ago
Actions for Neglected Basics of AI Alignment
ERTS:
Adversarial
Robustness
Testing of Ethical
AI
via Semantic Perturbation in a Bounded Consequence Space
🧠
LLMs
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for ERTS: Adversarial Robustness Testing of Ethical AI via Semantic Perturbation in a Bounded Consequence Space
Designer babies. Self-improving
AI
. Are we ready for either?
🧠
LLMs
Content type:
News
vox.com
·
2d
2 days ago
Actions for Designer babies. Self-improving AI. Are we ready for either?
Is the Space Pope Reptilian?
🧠
LLMs
Content type:
News
tearsinrain.ai
·
2d
2 days ago
·
Hacker News
Actions for Is the Space Pope Reptilian?
Op
Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
🤝
AI Agents
thecompanydime.com
·
4d
4 days ago
Actions for Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
OpenClaw Won: How Big Tech Adopted the
AI
Agent
🤝
AI Agents
thelettertwo.com
·
4d
4 days ago
Actions for OpenClaw Won: How Big Tech Adopted the AI Agent
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help