🛡️ AI Safety - jinkai_lau · Scour

RiskNet: A large-scale dataset of AI risk incidents from news with alignment and multi-dimensional annotations

🔬AI Research Academic

Mechanistic Interpretability: The Key to Trusting Agentic AI

🔎AI Interpretability Discussion

bradenkelley.com·

ML4Good Summer 2026 Bootcamps - Applications Open!

🤖AI Engineering

lesswrong.com·

DTEX adds AI Risk Management to track how agents and employees use AI

🤖AI Engineering

siliconangle.com·

Preprint warns of catastrophic AI risks if no action is taken within five years

🤖AI Engineering

news.uq.edu.au··Hacker News

Less-relevant results

Anthropic May Be Reconsidering the Pace of AI

🤖AI Engineering

thinkingabout.ai·

[Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"

🤖AI Engineering Blog

meditationsondigitalminds.substack.com··Substack

Apollo’s Shutterfly Sweetens Debt as Investors Weigh AI Risk

🤖AI Engineering News

·

From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line

🤖AI Engineering

theconversation.com·

ToxicSkills Revisit: Loch Ness Levels of Mythical AI Risk

🌐Open Source

flyingpenguin.com·

Anthropic proposes global development pause to mitigate recursive AI risks

🤖AI Engineering

David Sacks argues AI catastrophe narratives justify government control, while Gary Marcus counters that AI risk is bipartisan

🔬AI Research News

New HSCC guidance confronts AI cyber risk, champions governance | TechTarget

🤖AI Engineering

·

Criti-hyping is the best thing that happened to Big Tech

🚀Emerging Tech

reveriesofahuman.com·

The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably

🔎AI Interpretability

lesswrong.com·

Did Microsoft take the AI risks so Apple didn’t have to? Cast your vote.

🤖AI Engineering

pureinfotech.com·

Veeam Research: Who is Responsible for Rogue AI Behaviour?

🤖AI Engineering

aimagazine.com·

Who Elected Anthropic?

🔎AI Interpretability Blog

vizierprime.substack.com··Substack

International Workshop on Risk and Insurance, 서울, June 2026

🔤Type Systems

freakonometrics.hypotheses.org·

Africa Faces AI Risks as Regulation Lags Behind Innovation

🤖AI Engineering Video

Log in to enable infinite scrolling