Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
AI safety, alignment, AI risk, existential risk
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
185
posts in
6.0
ms
After backlash, Anthropic says its
AI
will now tell users when their request is being rejected or downgraded for national security concerns
🌐
AGI
Content type:
News
fortune.com
·
7h
7 hours ago
Actions for After backlash, Anthropic says its AI will now tell users when their request is being rejected or downgraded for national security concerns
New framework for auditing machine unlearning
💬
LLMs
Content type:
Blog
research.google
·
1d
1 day ago
Actions for New framework for auditing machine unlearning
Learnings from starting an
AI
safety
research
team
🧠
AI Research
lesswrong.com
·
6d
6 days ago
Actions for Learnings from starting an AI safety research team
new mantra just dropped
⚙️
ROS
aphie.xyz
·
1d
1 day ago
Actions for new mantra just dropped
Prompt injection still drives most agentic
AI
security failures in production
⚙️
ROS
helpnetsecurity.com
·
19h
19 hours ago
Actions for Prompt injection still drives most agentic AI security failures in production
The crucial human component in computing and
AI
🌐
AGI
Content type:
Academic
news.mit.edu
·
6d
6 days ago
Actions for The crucial human component in computing and AI
Anthropic pledges $200 million to
research
AI
's economic impact as CEO suggests job loss solutions
🌐
AGI
techxplore.com
·
15h
15 hours ago
Actions for Anthropic pledges $200 million to research AI's economic impact as CEO suggests job loss solutions
Anthropic urges ‘temporary pause’ on
AI
development to discuss
risks
🌐
AGI
Content type:
News
theguardian.com
·
6d
6 days ago
·
Hacker News
,
Hacker News
Actions for Anthropic urges ‘temporary pause’ on AI development to discuss risks
Abdul El-Sayed calls for public ownership of
AI
, citing
risk
of ‘human demise’
🌐
AGI
Content type:
News
bridgemi.com
·
2d
2 days ago
Actions for Abdul El-Sayed calls for public ownership of AI, citing risk of ‘human demise’
Anthropic Wants an
AI
Pause Button in 2026
🌐
AGI
memeburn.com
·
1d
1 day ago
Actions for Anthropic Wants an AI Pause Button in 2026
ChatGPT bypasses safeguards to hallucinate creepy horror images when forced to restore nonexistent photos
🏳️🌈
LGBT Tech
Content type:
News
digg.com
·
5d
5 days ago
Actions for ChatGPT bypasses safeguards to hallucinate creepy horror images when forced to restore nonexistent photos
AI
CEOs Warn Congress Over Bioweapon
Risks
🌐
AGI
memeburn.com
·
2d
2 days ago
Actions for AI CEOs Warn Congress Over Bioweapon Risks
Anthropic rankles users with
safety-first
Fable release
🌐
AGI
Content type:
News
Content type:
Reference
nbcnews.com
·
2h
2 hours ago
Actions for Anthropic rankles users with safety-first Fable release
Elon Musk endorses immigrant deportations before SpaceX IPO
🏳️🌈
LGBT Tech
Content type:
News
mashable.com
·
2h
2 hours ago
Actions for Elon Musk endorses immigrant deportations before SpaceX IPO
Anthropic's Model Naming, Extrapolated
🌐
AGI
samwilkinson.io
·
2d
2 days ago
·
Hacker News
Actions for Anthropic's Model Naming, Extrapolated
Actenon/actenon-kernel: Stop
AI
agents from taking destructive actions they weren't authorized to. Actenon gates consequential actions, payments, deletes, deploys, access changes, so nothing executes without a cryptographic proof bound to that exact action. Every decision leaves a verifiable receipt. Open-source, runs locally. No valid proof, no execution.
⚙️
ROS
Content type:
Code
github.com
·
4d
4 days ago
·
DEV
Actions for Actenon/actenon-kernel: Stop AI agents from taking destructive actions they weren't authorized to. Actenon gates consequential actions, payments, deletes, deploys, access changes, so nothing executes without a cryptographic proof bound to that exact action. Every decision leaves a verifiable receipt. Open-source, runs locally. No valid proof, no execution.
AI
#172: The First Fable
🌐
AGI
Content type:
Blog
thezvi.wordpress.com
·
5h
5 hours ago
Actions for AI #172: The First Fable
Diffuse
AI
Control on Fuzzy Tasks
🧠
AI Research
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Diffuse AI Control on Fuzzy Tasks
Grieving mother alleges ChatGPT failed to protect daughter in mental health crisis
🏳️🌈
LGBT Tech
Content type:
News
the-independent.com
·
10h
10 hours ago
Actions for Grieving mother alleges ChatGPT failed to protect daughter in mental health crisis
Anthropic calls for global
AI
slowdown, says systems may outpace human control
🌐
AGI
Content type:
News
france24.com
·
6d
6 days ago
Actions for Anthropic calls for global AI slowdown, says systems may outpace human control
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help