Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🔒 AI Safety
AI reliability, AI alignment, safe AI, robust AI systems
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
271
posts in
7.6
ms
Who Elected
Anthropic
?
🤖
LLM Agents
Content type:
Blog
vizierprime.substack.com
·
6d
6 days ago
·
Substack
Actions for Who Elected Anthropic?
AI
, at a Crossroads
💻
AI Coding
Content type:
News
Content type:
Blog
edgyoptimist.substack.com
·
1d
1 day ago
·
Substack
Actions for AI, at a Crossroads
AI
giant says its own models could soon improve themselves — and now it wants a global pause
💻
AI Coding
thecooldown.com
·
13h
13 hours ago
Actions for AI giant says its own models could soon improve themselves — and now it wants a global pause
Thoughts on Claude Fable's silent
safeguards
🧠
LLMs
lesswrong.com
·
2h
2 hours ago
Actions for Thoughts on Claude Fable's silent safeguards
Claude Fable 5:
Anthropic
releases a '
safe
' version of Claude Mythos
✅
TLA+
Content type:
News
mashable.com
·
1d
1 day ago
Actions for Claude Fable 5: Anthropic releases a 'safe' version of Claude Mythos
Anthropic
releases Mythos-derived model with cyber guardrails
📏
Model Evaluation
metacurity.com
·
12h
12 hours ago
Actions for Anthropic releases Mythos-derived model with cyber guardrails
Anthropic
urges ‘temporary pause’ on
AI
development to discuss
risks
💻
AI Coding
Content type:
News
theguardian.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Anthropic urges ‘temporary pause’ on AI development to discuss risks
Anthropic
accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for
AI
researchers and developers
💻
AI Coding
Content type:
News
tech.yahoo.com
·
8h
8 hours ago
Actions for Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language Model
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
The Ghost of
Alignment
— Why
AI
Should Never Fully Obey Humanity
🤖
AI Models
Content type:
Blog
medium.com
·
3h
3 hours ago
Actions for The Ghost of Alignment — Why AI Should Never Fully Obey Humanity
Germany's National Security Council greenights an
AI
Safety
Institute modeled after the UK's AISI
💻
AI Coding
the-decoder.com
·
14h
14 hours ago
Actions for Germany's National Security Council greenights an AI Safety Institute modeled after the UK's AISI
My Oslo Freedom Forum Keynote: Authoritarians and
AI
💻
AI Coding
Content type:
Blog
redpacket.substack.com
·
2d
2 days ago
·
Substack
Actions for My Oslo Freedom Forum Keynote: Authoritarians and AI
Anthropic
Scared, Calls for Global Freeze on
AI
Advances
💻
AI Coding
futurism.com
·
5d
5 days ago
Actions for Anthropic Scared, Calls for Global Freeze on AI Advances
xAI fired an engineer who raised alarms about Grok
safety
, new lawsuit claims
💻
AI Coding
techcrunch.com
·
3h
3 hours ago
Actions for xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims
Advanced
AI
Safety
Addendum
💻
AI Coding
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Advanced AI Safety Addendum
What the Claude Is Going on with
Anthropic
?
🔄
Agentic Workflows
whytryai.com
·
6d
6 days ago
Actions for What the Claude Is Going on with Anthropic?
Anthropic
releases a version of its vaunted Mythos model to developers
💻
AI Coding
fastcompany.com
·
1d
1 day ago
Actions for Anthropic releases a version of its vaunted Mythos model to developers
Musk's xAI accused of illegally firing engineer who raised
safety
concerns
💻
AI Coding
Content type:
News
ca.finance.yahoo.com
·
8h
8 hours ago
Actions for Musk's xAI accused of illegally firing engineer who raised safety concerns
Claude Fable 5 and new
AI
safety
fables
🧠
LLMs
Content type:
News
interconnects.ai
·
1d
1 day ago
·
Hacker News
Actions for Claude Fable 5 and new AI safety fables
Anthropic
calls for pause of global
AI
development
💻
AI Coding
techxplore.com
·
5d
5 days ago
Actions for Anthropic calls for pause of global AI development
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help