Decision theory when you can't make decisions
lesswrong.comยท2d
๐ŸŽฏReinforcement Learning
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท3d
๐Ÿ”ขHomomorphic Encryption
Flag this post
Vaccination against ASI
lesswrong.comยท2d
๐Ÿ“ฎMessage Queues
Flag this post
Ink without haven
lesswrong.comยท3d
โœWriting
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.comยท5d
๐Ÿ”AI Interpretability
Flag this post
Centralization begets stagnation
lesswrong.comยท4d
๐ŸŒDistributed Systems
Flag this post
OpenAI Moves To Complete Potentially The Largest Theft In Human History
lesswrong.comยท3d
๐ŸŒOpen Source
Flag this post
Why Is Printing So Bad?
lesswrong.comยท2d
โœWriting
Flag this post
Seattle Secular Solstice 2025 โ€“ Dec 20th
lesswrong.comยท2d
๐ŸŒฟDigital Gardens
Flag this post
Transactional method for non-transactional relationship: Relationship as a Common-pool Resource problem
lesswrong.comยท6d
๐Ÿ•ธ๏ธGraph Databases
Flag this post
When Will AI Transform the Economy?
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท5d
๐Ÿ”AI Interpretability
Flag this post
Interview on the Hengshui Model High School
lesswrong.comยท4d
โœWriting
Flag this post
New 80,000 Hours problem profile on the risks of power-seeking AI
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐Ÿ‘๏ธObservability
Flag this post