Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🔒 AI Safety
AI reliability, AI alignment, safe AI, robust AI systems
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
269
posts in
7.0
ms
Who Elected
Anthropic
?
🤖
LLM Agents
Content type:
Blog
vizierprime.substack.com
·
6d
6 days ago
·
Substack
Actions for Who Elected Anthropic?
AI
, at a Crossroads
💻
AI Coding
Content type:
News
Content type:
Blog
edgyoptimist.substack.com
·
1d
1 day ago
·
Substack
Actions for AI, at a Crossroads
AI
giant says its own models could soon improve themselves — and now it wants a global pause
💻
AI Coding
thecooldown.com
·
16h
16 hours ago
Actions for AI giant says its own models could soon improve themselves — and now it wants a global pause
Anthropic
Walks Back Policy That Could Have ‘Sabotaged’
AI
Researchers Using Claude
📝
Prompt Engineering
Content type:
News
wired.com
·
1h
1 hour ago
·
Hacker News
,
r/ClaudeAI
Actions for Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude
Claude Fable 5:
Anthropic
releases a '
safe
' version of Claude Mythos
✅
TLA+
Content type:
News
mashable.com
·
1d
1 day ago
Actions for Claude Fable 5: Anthropic releases a 'safe' version of Claude Mythos
Phonies
✅
TLA+
lesswrong.com
·
14h
14 hours ago
Actions for Phonies
Anthropic
Urges Governments to Secure Power to Halt Dangerous
AI
💻
AI Coding
pymnts.com
·
2h
2 hours ago
Actions for Anthropic Urges Governments to Secure Power to Halt Dangerous AI
Anthropic
urges ‘temporary pause’ on
AI
development to discuss
risks
💻
AI Coding
Content type:
News
theguardian.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Anthropic urges ‘temporary pause’ on AI development to discuss risks
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language Model
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Anthropic
releases Mythos-derived model with cyber guardrails
📏
Model Evaluation
metacurity.com
·
15h
15 hours ago
Actions for Anthropic releases Mythos-derived model with cyber guardrails
Anthropic
accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for
AI
researchers and developers
💻
AI Coding
Content type:
News
tech.yahoo.com
·
11h
11 hours ago
Actions for Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers
My Oslo Freedom Forum Keynote: Authoritarians and
AI
💻
AI Coding
Content type:
Blog
redpacket.substack.com
·
2d
2 days ago
·
Substack
Actions for My Oslo Freedom Forum Keynote: Authoritarians and AI
Anthropic
Scared, Calls for Global Freeze on
AI
Advances
💻
AI Coding
futurism.com
·
5d
5 days ago
Actions for Anthropic Scared, Calls for Global Freeze on AI Advances
Germany's National Security Council greenights an
AI
Safety
Institute modeled after the UK's AISI
💻
AI Coding
the-decoder.com
·
17h
17 hours ago
Actions for Germany's National Security Council greenights an AI Safety Institute modeled after the UK's AISI
Anthropic
’s Dario Amodei wants governments to have the power to block ‘dangerous’
AI
systems
💻
AI Coding
siliconangle.com
·
2h
2 hours ago
Actions for Anthropic’s Dario Amodei wants governments to have the power to block ‘dangerous’ AI systems
Advanced
AI
Safety
Addendum
💻
AI Coding
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Advanced AI Safety Addendum
The Ghost of
Alignment
— Why
AI
Should Never Fully Obey Humanity
🤖
AI Models
Content type:
Blog
medium.com
·
6h
6 hours ago
Actions for The Ghost of Alignment — Why AI Should Never Fully Obey Humanity
What the Claude Is Going on with
Anthropic
?
🔄
Agentic Workflows
whytryai.com
·
6d
6 days ago
Actions for What the Claude Is Going on with Anthropic?
Anthropic
releases a version of its vaunted Mythos model to developers
💻
AI Coding
fastcompany.com
·
1d
1 day ago
Actions for Anthropic releases a version of its vaunted Mythos model to developers
xAI fired an engineer who raised alarms about Grok
safety
, new lawsuit claims
💻
AI Coding
techcrunch.com
·
6h
6 hours ago
Actions for xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help