AI Safety

Feeds to Scour
SubscribedAll
Scoured 142 posts in 9.5 ms

RiskNet: A large-scale dataset of AI risk incidents from news with alignment and multi-dimensional annotations

 🔬AI Research  Content type: Academic
arxiv.org·

Mechanistic Interpretability: The Key to Trusting Agentic AI

 🔎AI Interpretability  Content type: Discussion
bradenkelley.com·

ML4Good Summer 2026 Bootcamps - Applications Open!

 🤖AI Engineering
lesswrong.com·

DTEX adds AI Risk Management to track how agents and employees use AI

 🤖AI Engineering
siliconangle.com·

Preprint warns of catastrophic AI risks if no action is taken within five years

 🤖AI Engineering
Less-relevant results

Anthropic May Be Reconsidering the Pace of AI

 🤖AI Engineering
thinkingabout.ai·

[Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"

 🤖AI Engineering  Content type: Blog

Apollo’s Shutterfly Sweetens Debt as Investors Weigh AI Risk

 🤖AI Engineering  Content type: News
bloomberg.com
·

From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line

 🤖AI Engineering
theconversation.com·

ToxicSkills Revisit: Loch Ness Levels of Mythical AI Risk

 🌐Open Source
flyingpenguin.com·

Anthropic proposes global development pause to mitigate recursive AI risks

 🤖AI Engineering
4sysops.com·

David Sacks argues AI catastrophe narratives justify government control, while Gary Marcus counters that AI risk is bipartisan

 🔬AI Research  Content type: News
digg.com·

New HSCC guidance confronts AI cyber risk, champions governance | TechTarget

 🤖AI Engineering
techtarget.com
·

Criti-hyping is the best thing that happened to Big Tech

 🚀Emerging Tech

The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably

 🔎AI Interpretability
lesswrong.com·

Did Microsoft take the AI risks so Apple didn’t have to? Cast your vote.

 🤖AI Engineering
pureinfotech.com·

Veeam Research: Who is Responsible for Rogue AI Behaviour?

 🤖AI Engineering
aimagazine.com·

Who Elected Anthropic?

 🔎AI Interpretability  Content type: Blog

International Workshop on Risk and Insurance, 서울, June 2026

 🔤Type Systems

Africa Faces AI Risks as Regulation Lags Behind Innovation

 🤖AI Engineering  Content type: Video
youtube.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help