Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
alignment, RLHF, safety, interpretability
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
300
posts in
4.7
ms
Who Elected
Anthropic
?
🤖
AI Engineering
Content type:
Blog
vizierprime.substack.com
·
6d
6 days ago
·
Substack
Actions for Who Elected Anthropic?
RiskNet
: A large-scale dataset of
AI
risk incidents from news with
alignment
and multi-dimensional annotations
🤖
AI Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for RiskNet: A large-scale dataset of AI risk incidents from news with alignment and multi-dimensional annotations
Three types of
model
organism
🎯
Reinforcement Learning
lesswrong.com
·
11h
11 hours ago
Actions for Three types of model organism
Anthropic
urges US to require
safety
tests for most capable
AI
models
🔮
Multimodal AI
channelnewsasia.com
·
57m
57 minutes ago
Actions for Anthropic urges US to require safety tests for most capable AI models
Claude Fable 5:
Anthropic
releases a '
safe
' version of Claude Mythos
🤖
AI Engineering
Content type:
News
mashable.com
·
1d
1 day ago
Actions for Claude Fable 5: Anthropic releases a 'safe' version of Claude Mythos
AI
giant says its own
models
could soon improve themselves — and now it wants a global pause
🤖
AI Engineering
thecooldown.com
·
7h
7 hours ago
Actions for AI giant says its own models could soon improve themselves — and now it wants a global pause
Anthropic
urges ‘temporary pause’ on
AI
development to discuss
risks
🤖
Robotics
Content type:
News
theguardian.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Anthropic urges ‘temporary pause’ on AI development to discuss risks
Anthropic
releases Mythos-derived
model
with cyber guardrails
🤖
AI Engineering
metacurity.com
·
6h
6 hours ago
Actions for Anthropic releases Mythos-derived model with cyber guardrails
AI
, at a Crossroads
🤖
AI Engineering
Content type:
News
Content type:
Blog
edgyoptimist.substack.com
·
1d
1 day ago
·
Substack
Actions for AI, at a Crossroads
Anthropic
accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for
AI
researchers
and developers
🤖
AI Engineering
Content type:
News
fortune.com
·
2h
2 hours ago
Actions for Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers
Claude Fable 5 and new
AI
safety
fables
🧠
LLM Research
Content type:
News
interconnects.ai
·
21h
21 hours ago
·
Hacker News
Actions for Claude Fable 5 and new AI safety fables
Anthropic
proposes global development pause to mitigate recursive
AI
risks
🤖
Robotics
4sysops.com
·
5d
5 days ago
Actions for Anthropic proposes global development pause to mitigate recursive AI risks
Mythos and the Adolescence of
AI
Policy
🤖
AI Engineering
Content type:
News
luizasnewsletter.com
·
2d
2 days ago
Actions for Mythos and the Adolescence of AI Policy
Anthropic
's
Model
Naming, Extrapolated
🤖
AI Engineering
samwilkinson.io
·
23h
23 hours ago
·
Hacker News
Actions for Anthropic's Model Naming, Extrapolated
Anthropic
releases a version of its vaunted Mythos
model
to developers
🤖
AI Engineering
fastcompany.com
·
1d
1 day ago
Actions for Anthropic releases a version of its vaunted Mythos model to developers
Germany's National Security Council greenights an
AI
Safety
Institute
modeled
after the UK's AISI
🤖
AI Engineering
the-decoder.com
·
8h
8 hours ago
Actions for Germany's National Security Council greenights an AI Safety Institute modeled after the UK's AISI
Anthropic
Scared, Calls for Global Freeze on
AI
Advances
🤖
Robotics
futurism.com
·
5d
5 days ago
Actions for Anthropic Scared, Calls for Global Freeze on AI Advances
Advanced
AI
Safety
Addendum
🤖
AI Engineering
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Advanced AI Safety Addendum
Musk's xAI accused of illegally firing engineer who raised
safety
concerns
🤖
AI Engineering
Content type:
News
ca.finance.yahoo.com
·
3h
3 hours ago
Actions for Musk's xAI accused of illegally firing engineer who raised safety concerns
What the Claude Is Going on with
Anthropic
?
🧠
LLM Research
whytryai.com
·
6d
6 days ago
Actions for What the Claude Is Going on with Anthropic?
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help