Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐Ÿ‘๏ธObservability
Flag this post
New 80,000 Hours problem profile on the risks of power-seeking AI
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Why I Transitioned: A Case Study
lesswrong.comยท2d
โˆ˜Category Theory
Flag this post
Upcoming Workshop on Post-AGI Economics, Culture, and Governance
lesswrong.comยท6d
๐ŸŒOpen Source
Flag this post
Please Do Not Sell B30A Chips to China
lesswrong.comยท5d
๐Ÿ”ŒEmbedded Systems
Flag this post
On The Conservation of Rights
lesswrong.comยท4d
๐Ÿ”ขHomomorphic Encryption
Flag this post
Interview on the Hengshui Model High School
lesswrong.comยท5d
โœWriting
Flag this post
Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐Ÿ‘๏ธObservability
Flag this post
Genius is Not About Genius
lesswrong.comยท5d
โˆ˜Category Theory
Flag this post
Mottes and Baileys in AI discourse
lesswrong.comยท6d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Seattle Secular Solstice 2025 โ€“ Dec 20th
lesswrong.comยท2d
๐ŸŒฟDigital Gardens
Flag this post
Emergent Introspective Awareness in Large Language Models
lesswrong.comยท5d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Temporarily Losing My Ego
lesswrong.comยท6d
๐Ÿ Self-Hosting
Flag this post
Supervillain Monologues Are Unrealistic
lesswrong.comยท3d
โœWriting
Flag this post
The Memetics of AI Successionism
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Why Would we get Inner Misalignment by Default?
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Workshop on Post-AGI Economics, Culture, and Governance
lesswrong.comยท6d
๐ŸŒOpen Source
Flag this post