A toy model of corrigibility
lesswrong.comยท4h
โกIncremental Computation
Flag this post
Agentic AI and Security
๐MLOps
Flag this post
My YC Pitch
lesswrong.comยท12h
๐Open Source
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.comยท14h
๐ฟDigital Gardens
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท20h
โCategory Theory
Flag this post
Model welfare and open source
lesswrong.comยท20h
โกIncremental Computation
Flag this post
RSS feeds discovery strategies
๐กRSS
Flag this post
Reason About Intelligence, Not AI
lesswrong.comยท3h
๐AI Interpretability
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comยท7h
๐ฟDigital Gardens
Flag this post
Doom from a Solution to the Alignment Problem
lesswrong.comยท6h
โกIncremental Computation
Flag this post
Reflections on 4 years of meta-honesty
lesswrong.comยท17h
ฮปFunctional Programming
Flag this post
Human Values โ Goodness
lesswrong.comยท3h
๐ฟDigital Gardens
Flag this post
Asking Paul Fussell for Writing Advice
lesswrong.comยท1d
โCategory Theory
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.comยท1d
๐AI Interpretability
Flag this post
Halfhaven Digest #3
lesswrong.comยท2d
๐กRSS
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท2d
๐ฏReinforcement Learning
Flag this post
Vaccination against ASI
lesswrong.comยท1d
๐AI Interpretability
Flag this post
Seattle Secular Solstice 2025 โ Dec 20th
lesswrong.comยท1d
๐ฟDigital Gardens
Flag this post
Loading...Loading more...