AI Safety

AI reliability, AI alignment, safe AI, robust AI systems

Feeds to Scour
SubscribedAll
Scoured 269 posts in 5.3 ms

Anthropic urges a way to pause AI development as risks grow with the tech advances

 💻AI Coding
the-journal.com·

ML4Good Summer 2026 Bootcamps - Applications Open!

 💻AI Coding
lesswrong.com·

A Unifying Lens on Reward Uncertainty in RLHF

 🧠LLMs  Content type: Academic
arxiv.org·

towards a typology of people who feel really quite strongly about AI

 🤖AI Models
aphie.xyz·

From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line

 🤖AI Models
theconversation.com·

Paving the way for agents in biology

 🤖LLM Agents
anthropic.com··Hacker News

Anthropic urges AI labs to pause, warns humans risk losing control

 💻AI Coding  Content type: Video  Content type: News
aljazeera.com·

New framework for auditing machine unlearning

 TLA+  Content type: Blog
research.google·

If You Think AI Companies Are Unethical Now, Wait Until They Go Public

 💻AI Coding
futurism.com·

Anthropic’s Shocking Warning: AI Could Soon Upgrade Itself—Should the World Hit Pause?

 💻AI Coding  Content type: Video
youtube.com·

On AI Safety Concerns, Mark Carney Is Out of Step with Canadians

 💻AI Coding  Content type: News
thetyee.ca
·

Anthropic warns AI could soon build itself without human involvement—and urges a global pause on development

 💻AI Coding  Content type: News
tech.yahoo.com·

The Stoic Path to Actual AI Safety: Three Practical Steps for Industry and Individuals

 💻AI Coding
oodaloop.com·

new mantra just dropped

 💻AI Coding
aphie.xyz·

Anthropic urges global freeze on AI as it warns of losing control

 💻AI Coding  Content type: News
smh.com.au
··r/singularity

Assessing the Polyglot Chatbot: Multilingual Safety in AI Systems

 📝Prompt Engineering
cdt.org·

Anthropic's Latest PR Triumph

 💻AI Coding  Content type: News
nonzero.org·

Criti-hyping is the best thing that happened to Big Tech

 🌐Distributed Systems

I Started an AI Safety Research Org and Think These 7 Things Matter

 💻AI Coding
lesswrong.com·

The Best Politician In A Generation

 📏Model Evaluation  Content type: News  Content type: Blog
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help