Weak-To-Strong Generalization
lesswrong.comยท20h
ฮปFunctional Programming
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comยท7h
๐ฟDigital Gardens
Flag this post
A toy model of corrigibility
lesswrong.comยท4h
โกIncremental Computation
Flag this post
Evidence on language model consciousness
lesswrong.comยท1d
๐AI Interpretability
Flag this post
Human Values โ Goodness
lesswrong.comยท3h
๐ฟDigital Gardens
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.comยท1d
๐AI Interpretability
Flag this post
Asking Paul Fussell for Writing Advice
lesswrong.comยท1d
โWriting
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท2d
๐ฏReinforcement Learning
Flag this post
25 Que
lesswrong.comยท10h
๐๏ธZettelkasten
Flag this post
Why I Transitioned: A Case Study
lesswrong.comยท23h
โWriting
Flag this post
FTL travel and scientific realism
lesswrong.comยท16h
๐๏ธObservability
Flag this post
Decision theory when you can't make decisions
lesswrong.comยท1d
๐ฏReinforcement Learning
Flag this post
Doom from a Solution to the Alignment Problem
lesswrong.comยท6h
โกIncremental Computation
Flag this post
My YC Pitch
lesswrong.comยท12h
๐Open Source
Flag this post
Reason About Intelligence, Not AI
lesswrong.comยท3h
๐AI Interpretability
Flag this post
Brainstorming 25 Questions I Am Interested In
lesswrong.comยท10h
๐๏ธZettelkasten
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comยท17h
๐ฟDigital Gardens
Flag this post
Loading...Loading more...