Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🔒 Agentic Safety
Specific
AI agent constraints, tool-use safety, corrigibility
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
199976
posts in
19.2
ms
Why Does Agentic Safety Fail to
Generalize
Across
Tasks
?
🛡️
AI Safety
arxiv.org
·
3d
Empowerment,
corrigibility
, etc. are simple
abstractions
(of a messed-up ontology)
🔮
Perplexity
alignmentforum.org
·
2d
A lack of
introspective
ability is not a lack of
corrigibility
🔮
Perplexity
lesswrong.com
·
16h
On-Policy Self-Evolution via Failure
Trajectories
for Agentic Safety
Alignment
🛡️
AI Safety
arxiv.org
·
1d
Empowerment,
corrigibility
, etc. are simple
abstractions
(of a messed-up ontology)
🔮
Perplexity
lesswrong.com
·
2d
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help