Emergent Introspective Awareness in Large Language Models
lesswrong.comยท5d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Interview on the Hengshui Model High School
lesswrong.comยท5d
โœWriting
Flag this post
Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐Ÿ‘๏ธObservability
Flag this post
Mottes and Baileys in AI discourse
lesswrong.comยท6d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Why I Transitioned: A Case Study
lesswrong.comยท2d
โˆ˜Category Theory
Flag this post
AI Doomers Should Raise Hell
lesswrong.comยท5d
๐Ÿ”AI Interpretability
Flag this post
OpenAI Moves To Complete Potentially The Largest Theft In Human History
lesswrong.comยท3d
๐ŸŒOpen Source
Flag this post
A Very Simple Model of AI Dealmaking
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Why Would we get Inner Misalignment by Default?
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Why you shouldn't write a blog post every day for a month
lesswrong.comยท1d
โœWriting
Flag this post
When Will AI Transform the Economy?
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Centralization begets stagnation
lesswrong.comยท4d
๐ŸŒDistributed Systems
Flag this post
Please Do Not Sell B30A Chips to China
lesswrong.comยท5d
๐Ÿ”ŒEmbedded Systems
Flag this post
Supervillain Monologues Are Unrealistic
lesswrong.comยท3d
โœWriting
Flag this post
Temporarily Losing My Ego
lesswrong.comยท6d
๐Ÿ Self-Hosting
Flag this post
New 80,000 Hours problem profile on the risks of power-seeking AI
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Genius is Not About Genius
lesswrong.comยท5d
โˆ˜Category Theory
Flag this post