New 80,000 Hours problem profile on the risks of power-seeking AI
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Why Would we get Inner Misalignment by Default?
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Agentic AI and Security
martinfowler.comยท6dยท
๐Ÿš€MLOps
Flag this post
On The Conservation of Rights
lesswrong.comยท4d
๐Ÿ”ขHomomorphic Encryption
Flag this post
Supervillain Monologues Are Unrealistic
lesswrong.comยท3d
โœWriting
Flag this post
Ilya Sutskever Deposition Transcript
lesswrong.comยท1d
โœWriting
Flag this post
AI Doomers Should Raise Hell
lesswrong.comยท5d
๐Ÿ”AI Interpretability
Flag this post
Seattle Secular Solstice 2025 โ€“ Dec 20th
lesswrong.comยท2d
๐ŸŒฟDigital Gardens
Flag this post
Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐Ÿ‘๏ธObservability
Flag this post
A Very Simple Model of AI Dealmaking
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
An Opinionated Guide to Privacy Despite Authoritarianism
lesswrong.comยท5dยท
Discuss: r/privacy
๐Ÿ”ขHomomorphic Encryption
Flag this post
Interview on the Hengshui Model High School
lesswrong.comยท5d
โœWriting
Flag this post
The Memetics of AI Successionism
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Mottes and Baileys in AI discourse
lesswrong.comยท6d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
OpenAI Moves To Complete Potentially The Largest Theft In Human History
lesswrong.comยท3d
๐ŸŒOpen Source
Flag this post
Temporarily Losing My Ego
lesswrong.comยท6d
๐Ÿ Self-Hosting
Flag this post
Unsureism: The Rational Approach to Religious Uncertainty
lesswrong.comยท5d
โˆ˜Category Theory
Flag this post
Genius is Not About Genius
lesswrong.comยท5d
โˆ˜Category Theory
Flag this post