Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
Specific
AI alignment, safety, responsible AI, AGI risk
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
308
posts in
19.4
ms
The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
🤖
AI
lesswrong.com
·
10h
10 hours ago
Actions for The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
Sam Altman joins rivals in call to prevent
AI-developed
bioweapons
🤖
AI
the-independent.com
·
6d
6 days ago
Actions for Sam Altman joins rivals in call to prevent AI-developed bioweapons
Diffuse
AI
Control on Fuzzy Tasks
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Diffuse AI Control on Fuzzy Tasks
Anthropic Calls for Frontier
AI
Freeze to Prevent Self-Building Tech
🔐
Security
pymnts.com
·
5d
5 days ago
Actions for Anthropic Calls for Frontier AI Freeze to Prevent Self-Building Tech
Lawmakers Are Aiming To Regulate
AI-Builds-AI
Before
AI
Gets Entirely Beyond Human Control
🤖
AI
forbes.com
·
1d
1 day ago
Actions for Lawmakers Are Aiming To Regulate AI-Builds-AI Before AI Gets Entirely Beyond Human Control
OpenAI, Anthropic, and Meta Agree on This 1 Critical Decision About
AI
Safety
🤖
AI
inc.com
·
6d
6 days ago
Actions for OpenAI, Anthropic, and Meta Agree on This 1 Critical Decision About AI Safety
My Data Science Internship Journey at Oasis Infobyte: Building Real-World Machine Learning Projects
⚙️
ML Engineering
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for My Data Science Internship Journey at Oasis Infobyte: Building Real-World Machine Learning Projects
Making Claude a chemist
🔬
Science
anthropic.com
·
5d
5 days ago
·
Hacker News
,
r/singularity
Actions for Making Claude a chemist
How valuable are weak
AI
safety
regulations?
🤖
AI
lesswrong.com
·
2d
2 days ago
Actions for How valuable are weak AI safety regulations?
Anthropic self-improvement, pause
🤖
AI
manton.org
·
5d
5 days ago
Actions for Anthropic self-improvement, pause
VFUSE: Virulent Feature Understanding with Sparse autoEncoders
⚙️
ML Engineering
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for VFUSE: Virulent Feature Understanding with Sparse autoEncoders
Trump signs voluntary
AI
safety
order after pushback cuts federal review to 30 days
🔐
Security
thecooldown.com
·
6d
6 days ago
Actions for Trump signs voluntary AI safety order after pushback cuts federal review to 30 days
ChatGPT bypasses safeguards to hallucinate creepy horror images when forced to restore nonexistent photos
🧬
Biohacking
Content type:
News
digg.com
·
3d
3 days ago
Actions for ChatGPT bypasses safeguards to hallucinate creepy horror images when forced to restore nonexistent photos
Five Eyes issues unusual warning on China's online recruitment tactics
🔐
Security
metacurity.com
·
6d
6 days ago
Actions for Five Eyes issues unusual warning on China's online recruitment tactics
Anthropic urges ‘temporary pause’ on
AI
development to discuss
risks
🔐
Security
Content type:
News
theguardian.com
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Anthropic urges ‘temporary pause’ on AI development to discuss risks
Trajectory Geometry of Transformer Representations Across Layers
🎓
Computer Science
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Trajectory Geometry of Transformer Representations Across Layers
Actenon/actenon-kernel: Stop
AI
agents from taking destructive actions they weren't authorized to. Actenon gates consequential actions, payments, deletes, deploys, access changes, so nothing executes without a cryptographic proof bound to that exact action. Every decision leaves a verifiable receipt. Open-source, runs locally. No valid proof, no execution.
🏠
Self-Hosting
Content type:
Code
github.com
·
3d
3 days ago
·
DEV
Actions for Actenon/actenon-kernel: Stop AI agents from taking destructive actions they weren't authorized to. Actenon gates consequential actions, payments, deletes, deploys, access changes, so nothing executes without a cryptographic proof bound to that exact action. Every decision leaves a verifiable receipt. Open-source, runs locally. No valid proof, no execution.
Iliad is Hiring
✍️
Prompt Engineering
lesswrong.com
·
3d
3 days ago
Actions for Iliad is Hiring
When Attribution Patching Lies: Diagnosis and a Second-Order Correction
⚙️
ML Engineering
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for When Attribution Patching Lies: Diagnosis and a Second-Order Correction
Who Elected Anthropic?
☁️
SaaS
Content type:
Blog
vizierprime.substack.com
·
6d
6 days ago
·
Substack
Actions for Who Elected Anthropic?
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help