AI Safety

AI alignment, safety research, interpretability, superintelligence

Feeds to Scour
SubscribedAll
Scoured 138 posts in 6.9 ms

Endpoint Security Built for Outcomes, Not Noise

 💳Fintech
arcticwolf.com·

Anthropic reports its engineers now ship eight times more code per quarter, prompting debate over how to measure AI productivity

 Software Perf
digg.com·

Interactions Between Crosscoder Features: A Compact Proofs Perspective

 💬LLMs  Content type: Academic
arxiv.org·

AI drug discovery leaders warn U.S. health funding cuts risk falling behind global rivals

 🤖ai  Content type: News
fortune.com
·

The crucial human component in computing and AI

 🤖ai  Content type: Academic
news.mit.edu·

Trump and Xi alone may have the chance to stop AI from spiralling out of control

 🤖ai  Content type: News
smh.com.au
·
Less-relevant results

Nine Things About Claude Mythos 5 That Matter If You’re Not an Enterprise Customer

 🤖ai  Content type: News
thealgorithmicbridge.com·

Iliad is Hiring

 🔓Open Source AI
lesswrong.com·

FoldSAE: Learning to Steer Protein Folding Through Sparse Representations

 ⚙️ML Engineering  Content type: Academic
arxiv.org·

Mira Murati Unveils Her Startup’s A.I. Model in First Interview Since OpenAI

 🔓Open Source AI  Content type: News
observer.com·

Meta Stock Turns $1,000 Into $5,300 in 10 Years But Did It Beat The S&P 500?

 🔓Open Source AI
247wallst.com·

Quote of the day by Anthropic CEO Dario Amodei: "Humanity is about to be handed almost unimaginable power, and it is deeply unclear whether [we] possess the maturity to wield it" — warnings on the looming threat of beyond-human AI

 🧠AI Research
techradar.com
·

FT: ChatGPT getting a ‘superapp’ revamp before OpenAI hits IPO

 🔓Open Source AI
siliconrepublic.com·

Ablation-Reversible Heads Don't Transfer: A Stress Test for Mechanistic Role Claims in Transformers

 🤖ai  Content type: Academic
arxiv.org·

Meta Keeps Delaying the Release of Its New AI Model to Developers

 🤖ai
meta.slashdot.org·

umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.

 🤖ai  Content type: Code
github.com··r/SideProject

Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing

 🤖ai  Content type: News  Content type: Blog

Director Nick Holt on Exploring AI’s Origins in Tribeca Doc ‘AI: Probably Nothing to Worry About’: Film Is About ‘the Creation of a Sort of Species’ (EXCLUSIVE)

 🤖ai  Content type: News
variety.com·

Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse

 🤖ai
thecompanydime.com·

Beyond Safety Through Filtering: Toward Responsible Training on Human Distress

 ⚙️ML Engineering  Content type: Blog
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help