Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
alignment, RLHF, safety, interpretability
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
282
posts in
5.5
ms
Who Elected
Anthropic
?
🤖
AI Engineering
Content type:
Blog
vizierprime.substack.com
·
5d
5 days ago
·
Substack
Actions for Who Elected Anthropic?
RiskNet
: A large-scale dataset of
AI
risk incidents from news with
alignment
and multi-dimensional annotations
🤖
AI Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for RiskNet: A large-scale dataset of AI risk incidents from news with alignment and multi-dimensional annotations
Claude Fable 5:
Anthropic
releases a '
safe
' version of Claude Mythos
🤖
AI Engineering
Content type:
News
mashable.com
·
14h
14 hours ago
Actions for Claude Fable 5: Anthropic releases a 'safe' version of Claude Mythos
Three types of
model
organism
🧠
LLM Research
lesswrong.com
·
33m
33 minutes ago
Actions for Three types of model organism
AI
, at a Crossroads
🤖
AI Engineering
Content type:
News
Content type:
Blog
edgyoptimist.substack.com
·
16h
16 hours ago
·
Substack
Actions for AI, at a Crossroads
Mythos and the Adolescence of
AI
Policy
🤖
AI Engineering
Content type:
News
luizasnewsletter.com
·
2d
2 days ago
Actions for Mythos and the Adolescence of AI Policy
Anthropic
urges ‘temporary pause’ on
AI
development to discuss
risks
🤖
Robotics
Content type:
News
theguardian.com
·
4d
4 days ago
·
Hacker News
,
Hacker News
Actions for Anthropic urges ‘temporary pause’ on AI development to discuss risks
Anthropic
releases a version of its vaunted Mythos
model
to developers
🤖
AI Engineering
fastcompany.com
·
15h
15 hours ago
Actions for Anthropic releases a version of its vaunted Mythos model to developers
Meta Security Failures, Agent Adoption, &
AI
Slowdown Push
🤖
AI Engineering
briefing.forwardfuture.ai
·
1d
1 day ago
Actions for Meta Security Failures, Agent Adoption, & AI Slowdown Push
Claude Fable 5 and new
AI
safety
fables
🧠
LLM Research
Content type:
News
interconnects.ai
·
10h
10 hours ago
·
Hacker News
Actions for Claude Fable 5 and new AI safety fables
Anthropic
proposes global development pause to mitigate recursive
AI
risks
🤖
Robotics
4sysops.com
·
4d
4 days ago
Actions for Anthropic proposes global development pause to mitigate recursive AI risks
Clearing Up The Confusion About What
Anthropic
Really Said On Globally Pausing The Unrelenting Race Toward
AI
That Builds
AI
🤖
AI Engineering
forbes.com
·
2d
2 days ago
Actions for Clearing Up The Confusion About What Anthropic Really Said On Globally Pausing The Unrelenting Race Toward AI That Builds AI
Advanced
AI
Safety
Addendum
🤖
AI Engineering
cloud.google.com
·
13h
13 hours ago
·
Hacker News
Actions for Advanced AI Safety Addendum
My Oslo Freedom Forum Keynote: Authoritarians and
AI
🤖
AI Engineering
Content type:
Blog
redpacket.substack.com
·
1d
1 day ago
·
Substack
Actions for My Oslo Freedom Forum Keynote: Authoritarians and AI
As SpaceX, OpenAI and
Anthropic
plan blockbuster launches, will it make
AI
giants more accountable?
🤖
AI Engineering
theconversation.com
·
13h
13 hours ago
Actions for As SpaceX, OpenAI and Anthropic plan blockbuster launches, will it make AI giants more accountable?
Anthropic
Scared, Calls for Global Freeze on
AI
Advances
🤖
Robotics
futurism.com
·
4d
4 days ago
Actions for Anthropic Scared, Calls for Global Freeze on AI Advances
Anthropic
Tries to Revive the “
AI
Pause”
🔮
Multimodal AI
internetgovernance.org
·
2d
2 days ago
Actions for Anthropic Tries to Revive the “AI Pause”
DTEX adds
AI
Risk
Management to track how agents and employees use
AI
🤖
AI Engineering
siliconangle.com
·
20h
20 hours ago
Actions for DTEX adds AI Risk Management to track how agents and employees use AI
What the Claude Is Going on with
Anthropic
?
🧠
LLM Research
whytryai.com
·
5d
5 days ago
Actions for What the Claude Is Going on with Anthropic?
Anthropic
May Be Reconsidering the Pace of
AI
🤖
Robotics
thinkingabout.ai
·
16h
16 hours ago
Actions for Anthropic May Be Reconsidering the Pace of AI
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help