Fragments Nov 3
martinfowler.comยท45m
Weak-To-Strong Generalization
lesswrong.comยท1d
โCategory Theory
Flag this post
To improve Rationality, create Situations
lesswrong.comยท9h
๐๏ธZettelkasten
Flag this post
Solving a problem with mindware
lesswrong.comยท10h
๐๏ธZettelkasten
Flag this post
A toy model of corrigibility
lesswrong.comยท1d
โกIncremental Computation
Flag this post
How Powerful AI Gets Cheap
lesswrong.comยท7h
๐ขHomomorphic Encryption
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comยท1d
๐ฟDigital Gardens
Flag this post
Model welfare and open source
lesswrong.comยท1d
โกIncremental Computation
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท3d
๐ฏReinforcement Learning
Flag this post
Evidence on language model consciousness
lesswrong.comยท2d
๐AI Interpretability
Flag this post
Parleying with the Principled
lesswrong.comยท1h
๐ขHomomorphic Encryption
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comยท1d
๐ฟDigital Gardens
Flag this post
Loading...Loading more...