Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐Ÿ‘๏ธObservability
Flag this post
Body Time and Daylight Savings Apologetics
lesswrong.comยท1d
๐Ÿ”ขHomomorphic Encryption
Flag this post
25 Que
lesswrong.comยท2d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Freewriting in my head, and overcoming the โ€œtwinge of startingโ€
lesswrong.comยท3d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Decision theory when you can't make decisions
lesswrong.comยท2d
๐ŸŽฏReinforcement Learning
Flag this post
Brainstorming 25 Questions I Am Interested In
lesswrong.comยท2d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comยท2d
๐ŸŒฟDigital Gardens
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท6d
๐Ÿ”AI Interpretability
Flag this post
Doom from a Solution to the Alignment Problem
lesswrong.comยท2d
โšกIncremental Computation
Flag this post
Reason About Intelligence, Not AI
lesswrong.comยท1d
๐Ÿ”AI Interpretability
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.comยท2d
๐ŸŒฟDigital Gardens
Flag this post
There's some chance oral herpes is pretty bad for you?
lesswrong.comยท1d
๐Ÿฆ€Rust
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท3d
๐Ÿ”ขHomomorphic Encryption
Flag this post
Human Values โ‰  Goodness
lesswrong.comยท1d
๐ŸŒฟDigital Gardens
Flag this post
Publishing academic papers on transformative AI is a nightmare
lesswrong.comยท1dยท
Discuss: Hacker News
๐Ÿ”AI Interpretability
Flag this post