Back to article

stevekinney.com

Some Thoughts on AI Safety (opens in new tab)

Covers 9 stories including Goodhart's LawDiscussed on Hacker News

Covers 9 related stories

en.wikipedia.org·

Goodhart's Law

Discussed on Hacker News and Hacker News

Machines of Loving Grace

Discussed on Hacker News and Hacker News

en.wikipedia.org·

Seeing Like a State

Discussed on Hacker News

The Urgency of Interpretability

Discussed on Hacker News and DEV

anthropic.com·

https://www.anthropic.com/research/constitutional-ai-harmlessness-from-ai-feedback

anthropic.com·

Core Views on AI Safety: When, Why, What, and How

Discussed on Hacker News

en.wikipedia.org·

Streetlight Effect

Discussed on Hacker News

Beyond 'Is It Intelligent?': A 5-Layer Framework for Understanding What LLMs Actually Do

Discussed on DEV

anthropic.com·

Anthropic's Responsible Scaling Policy