You think you are in control?
lesswrong.comยท13h
๐ Self-Hosting
Flag this post
My YC Pitch
lesswrong.comยท1d
๐Open Source
Flag this post
Human Values โ  Goodness
lesswrong.comยท1d
๐ฟDigital Gardens
Flag this post
Parleying with the Principled
lesswrong.comยท6h
๐ขHomomorphic Encryption
Flag this post
Build the life you actually want
lesswrong.comยท2h
๐ฟDigital Gardens
Flag this post
Sonnet 4.5's eval gaming seriously undermines alignment evals, and this seems caused by training on alignment evals
lesswrong.comยท4d
๐AI Interpretability
Flag this post
High-Resistance Systems to Change: Can a Political Strategy Apply to Personal Change?
lesswrong.comยท11h
โกIncremental Computation
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comยท2d
๐ฟDigital Gardens
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.comยท5d
๐AI Interpretability
Flag this post
Ohio House Bill 469
lesswrong.comยท1d
๐Embedded Systems
Flag this post
Debugging Despair ~> A bet about Satisfaction and Values
lesswrong.comยท3d
โกIncremental Computation
Flag this post
Spending Less by Doing More
lesswrong.comยท1d
๐ฏReinforcement Learning
Flag this post
When Will AI Transform the Economy?
lesswrong.comยท6d
๐AI Interpretability
Flag this post
Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐๏ธObservability
Flag this post
Asking Paul Fussell for Writing Advice
lesswrong.comยท3d
โCategory Theory
Flag this post
Ink without haven
lesswrong.comยท3d
โWriting
Flag this post
Why do AI models use so many em-dashes?
โWriting
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท3d
๐ขHomomorphic Encryption
Flag this post
Strategy-Stealing Argument Against AI Dealmaking
lesswrong.comยท3d
๐ฏReinforcement Learning
Flag this post
Agentic AI and Security
๐MLOps
Flag this post
Loading...Loading more...