Temporarily Losing My Ego
lesswrong.comยท6d
๐Ÿ Self-Hosting
Flag this post
Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท3d
๐Ÿ‘๏ธObservability
Flag this post
Decision theory when you can't make decisions
lesswrong.comยท1d
๐ŸŽฏReinforcement Learning
Flag this post
Seattle Secular Solstice 2025 โ€“ Dec 20th
lesswrong.comยท2d
๐ŸŒฟDigital Gardens
Flag this post
Brainstorming Food on the Cheap+Healthy+Convenient+Edible Frontier
lesswrong.comยท6d
๐ŸŒฟDigital Gardens
Flag this post
Emergent Introspective Awareness in Large Language Models
lesswrong.comยท4d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Strategy-Stealing Argument Against AI Dealmaking
lesswrong.comยท2d
๐ŸŽฏReinforcement Learning
Flag this post
Transactional method for non-transactional relationship: Relationship as a Common-pool Resource problem
lesswrong.comยท6d
๐Ÿ•ธ๏ธGraph Databases
Flag this post
OpenAI Moves To Complete Potentially The Largest Theft In Human History
lesswrong.comยท3d
๐ŸŒOpen Source
Flag this post
No title
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Supervillain Monologues Are Unrealistic
lesswrong.comยท2d
โœWriting
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท5d
๐Ÿ”AI Interpretability
Flag this post
Workshop on Post-AGI Economics, Culture, and Governance
lesswrong.comยท5d
๐ŸŒOpen Source
Flag this post
Q2 AI Benchmark Results: Pros Maintain Clear Lead
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท3d
๐Ÿ‘๏ธObservability
Flag this post
Verified Relational Alignment: A Framework for Robust AI Safety Through Collaborative Trust
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
On The Conservation of Rights
lesswrong.comยท4d
๐Ÿ”ขHomomorphic Encryption
Flag this post