Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Safety
🛡️ AI Safety
alignment, RLHF, safety, interpretability
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
300
posts in
7.2
ms
ToxicSkills Revisit: Loch Ness Levels of Mythical
AI
Risk
🧠
LLM Research
flyingpenguin.com
·
2d
2 days ago
Actions for ToxicSkills Revisit: Loch Ness Levels of Mythical AI Risk
Representation-Aware Advantage Estimation: Your
Reward
Model
Provides More Than A Scalar Output
🧠
LLM Research
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output
Anthropic
urges
AI
labs to pause, warns humans
risk
losing control
🤖
Robotics
Content type:
Video
Content type:
News
aljazeera.com
·
5d
5 days ago
Actions for Anthropic urges AI labs to pause, warns humans risk losing control
towards a typology of people who feel really quite strongly about
AI
🔮
Multimodal AI
aphie.xyz
·
5h
5 hours ago
Actions for towards a typology of people who feel really quite strongly about AI
Lawmakers Are Aiming To Regulate
AI-Builds-AI
Before
AI
Gets Entirely Beyond Human Control
🤖
AI Engineering
forbes.com
·
1d
1 day ago
Actions for Lawmakers Are Aiming To Regulate AI-Builds-AI Before AI Gets Entirely Beyond Human Control
ML4Good Summer 2026 Bootcamps - Applications Open!
🧠
LLM Research
lesswrong.com
·
9h
9 hours ago
Actions for ML4Good Summer 2026 Bootcamps - Applications Open!
Anthropic
’s Shocking Warning:
AI
Could Soon Upgrade Itself—Should the World Hit Pause?
🤖
Robotics
Content type:
Video
youtube.com
·
4d
4 days ago
Actions for Anthropic’s Shocking Warning: AI Could Soon Upgrade Itself—Should the World Hit Pause?
The mega-IPO wave led by SpaceX and
Anthropic
has retirees worried about their finances. Their advisors say otherwise.
🤖
Robotics
Content type:
News
businessinsider.com
·
1d
1 day ago
Actions for The mega-IPO wave led by SpaceX and Anthropic has retirees worried about their finances. Their advisors say otherwise.
Anthropic
warns
AI
could soon build itself without human involvement—and urges a global pause on development
🤖
Robotics
Content type:
News
tech.yahoo.com
·
5d
5 days ago
Actions for Anthropic warns AI could soon build itself without human involvement—and urges a global pause on development
If You Think
AI
Companies Are Unethical Now, Wait Until They Go Public
🤖
Robotics
futurism.com
·
1d
1 day ago
Actions for If You Think AI Companies Are Unethical Now, Wait Until They Go Public
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🤖
Robotics
Content type:
Discussion
bradenkelley.com
·
4d
4 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
Anthropic
Tries to Revive the “
AI
Pause”
🔮
Multimodal AI
internetgovernance.org
·
3d
3 days ago
Actions for Anthropic Tries to Revive the “AI Pause”
Alignment
Defends LLMs from Property Inference Attacks
🧠
LLM Research
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Alignment Defends LLMs from Property Inference Attacks
Germany to create
AI
safety
agency
🔮
Multimodal AI
techxplore.com
·
1d
1 day ago
Actions for Germany to create AI safety agency
Anthropic
urges global freeze on
AI
as it warns of losing control
🤖
Robotics
Content type:
News
smh.com.au
·
5d
5 days ago
·
r/singularity
Actions for Anthropic urges global freeze on AI as it warns of losing control
The Stoic Path to Actual
AI
Safety
: Three Practical Steps for Industry and Individuals
🤖
AI Engineering
oodaloop.com
·
2d
2 days ago
Actions for The Stoic Path to Actual AI Safety: Three Practical Steps for Industry and Individuals
Anthropic
's Latest PR Triumph
🤖
AI Engineering
Content type:
News
nonzero.org
·
4d
4 days ago
Actions for Anthropic's Latest PR Triumph
Sequent: scale and automation for higher confidence in
alignment
🧠
LLM Research
lesswrong.com
·
4h
4 hours ago
Actions for Sequent: scale and automation for higher confidence in alignment
The Best Politician In A Generation
🤖
Robotics
Content type:
News
Content type:
Blog
benthams.substack.com
·
1d
1 day ago
·
Substack
Actions for The Best Politician In A Generation
Weekly news roundup:
Anthropic
goes public, Nvidia 'superchip,' and SpaceX historic IPO | TechTarget
🤖
AI Engineering
techtarget.com
·
5d
5 days ago
Actions for Weekly news roundup: Anthropic goes public, Nvidia 'superchip,' and SpaceX historic IPO | TechTarget
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help