Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
alignment, AI reliability, guardrails, responsible AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
61
posts in
5.5
ms
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🤝
AI Agents
Content type:
Discussion
bradenkelley.com
·
6d
6 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
Risk Under Pressure: Compute-Aware Evaluation of
Adversarial
Robustness
in Language Models
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models
The Pope Found the Missing Layer in
AI
Alignment
🤖
AI Engineering
Content type:
Blog
chrisperkins505.medium.com
·
19h
19 hours ago
Actions for The Pope Found the Missing Layer in AI Alignment
The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
🤝
AI Agents
lesswrong.com
·
2d
2 days ago
Actions for The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
[Recorded talk] "
AI
Alignment
Versus
AI
Ethical Treatment: 10 Challenges"
🤖
AI Engineering
Content type:
Blog
meditationsondigitalminds.substack.com
·
3d
3 days ago
·
Substack
Actions for [Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"
Anthropic’s Bet: Interview with Dario Amodei
🤝
AI Agents
4sysops.com
·
1d
1 day ago
Actions for Anthropic’s Bet: Interview with Dario Amodei
VFUSE: Virulent Feature Understanding with Sparse autoEncoders
🧠
LLMs
Content type:
Academic
biorxiv.org
·
1d
1 day ago
Actions for VFUSE: Virulent Feature Understanding with Sparse autoEncoders
Criti-hyping is the best thing that happened to Big Tech
🕸️
Distributed Systems
reveriesofahuman.com
·
3d
3 days ago
Actions for Criti-hyping is the best thing that happened to Big Tech
#5: Advertising is broken. It’s time to move your brand inside the model.
🔍
RAG
rhizome.org
·
4h
4 hours ago
Actions for #5: Advertising is broken. It’s time to move your brand inside the model.
Guardian Angels: LLM Personalization for Productivity and Security
🧠
LLMs
3
sources covering this post
gwern.net
·
5d
5 days ago
·
Hacker News
·
Cited by 3 articles
Actions for Guardian Angels: LLM Personalization for Productivity and Security
Less-relevant results
Adam Smith's Creation of a "Large Model" - 36 Kr
🤝
AI Agents
eu.36kr.com
·
19h
19 hours ago
Actions for Adam Smith's Creation of a "Large Model" - 36 Kr
AI
Will Not Start a Nuclear War, but Humans Might: Conclusions and Policy Recommendations The notion that
AI
could start a nuclear war may be attention-grabbing...
🤖
AI Engineering
ai-frontiers.org
·
2d
2 days ago
Actions for AI Will Not Start a Nuclear War, but Humans Might: Conclusions and Policy Recommendations The notion that AI could start a nuclear war may be attention-grabbing...
Solsong Chord Updates
🔍
RAG
jefftk.com
·
2d
2 days ago
Actions for Solsong Chord Updates
Neglected Basics of
AI
Alignment
🧠
LLMs
lesswrong.com
·
5d
5 days ago
Actions for Neglected Basics of AI Alignment
Cisco
AI
Defense Policy Studio: Turning Unwritten Policy into Adaptive
AI
Guardrails
🧠
LLMs
Content type:
Blog
blogs.cisco.com
·
1d
1 day ago
Actions for Cisco AI Defense Policy Studio: Turning Unwritten Policy into Adaptive AI Guardrails
ERTS:
Adversarial
Robustness
Testing of Ethical
AI
via Semantic Perturbation in a Bounded Consequence Space
🧠
LLMs
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for ERTS: Adversarial Robustness Testing of Ethical AI via Semantic Perturbation in a Bounded Consequence Space
Designer babies. Self-improving
AI
. Are we ready for either?
🧠
LLMs
Content type:
News
vox.com
·
2d
2 days ago
Actions for Designer babies. Self-improving AI. Are we ready for either?
Op
Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
🤝
AI Agents
thecompanydime.com
·
4d
4 days ago
Actions for Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
Is the Space Pope Reptilian?
🧠
LLMs
Content type:
News
tearsinrain.ai
·
2d
2 days ago
·
Hacker News
Actions for Is the Space Pope Reptilian?
Seven big ideas from 7x7
🤝
AI Agents
rhizome.org
·
4h
4 hours ago
Actions for Seven big ideas from 7x7
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help