🛡️ AI Safety - daemsc · Scour

Who Elected Anthropic?

🤖AI Engineering Blog

vizierprime.substack.com··Substack

RiskNet: A large-scale dataset of AI risk incidents from news with alignment and multi-dimensional annotations

🤖AI Engineering Academic

Three types of model organism

🎯Reinforcement Learning

lesswrong.com·

Anthropic urges US to require safety tests for most capable AI models

🔮Multimodal AI

channelnewsasia.com·

Claude Fable 5: Anthropic releases a 'safe' version of Claude Mythos

🤖AI Engineering News

AI giant says its own models could soon improve themselves — and now it wants a global pause

🤖AI Engineering

thecooldown.com·

Anthropic urges ‘temporary pause’ on AI development to discuss risks

🤖Robotics News

theguardian.com··Hacker News, Hacker News

Anthropic releases Mythos-derived model with cyber guardrails

🤖AI Engineering

metacurity.com·

AI, at a Crossroads

🤖AI Engineering News Blog

edgyoptimist.substack.com··Substack

Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits capabilities for AI researchers and developers

🤖AI Engineering News

·

Claude Fable 5 and new AI safety fables

🧠LLM Research News

interconnects.ai··Hacker News

Anthropic proposes global development pause to mitigate recursive AI risks

Mythos and the Adolescence of AI Policy

🤖AI Engineering News

luizasnewsletter.com·

Anthropic's Model Naming, Extrapolated

🤖AI Engineering

samwilkinson.io··Hacker News

Anthropic releases a version of its vaunted Mythos model to developers

🤖AI Engineering

fastcompany.com·

Germany's National Security Council greenights an AI Safety Institute modeled after the UK's AISI

🤖AI Engineering

the-decoder.com

·

Anthropic Scared, Calls for Global Freeze on AI Advances

Advanced AI Safety Addendum

🤖AI Engineering

cloud.google.com··Hacker News

Musk's xAI accused of illegally firing engineer who raised safety concerns

🤖AI Engineering News

ca.finance.yahoo.com·

What the Claude Is Going on with Anthropic?

🧠LLM Research

Log in to enable infinite scrolling