Weak-To-Strong Generalization
lesswrong.com·1d
∘Category Theory
Flag this post
Solving a problem with mindware
lesswrong.com·9h
🗃️Zettelkasten
Flag this post
To improve Rationality, create Situations
lesswrong.com·8h
🗃️Zettelkasten
Flag this post
A toy model of corrigibility
lesswrong.com·1d
⚡Incremental Computation
Flag this post
How Powerful AI Gets Cheap
lesswrong.com·7h
🔢Homomorphic Encryption
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·3d
🎯Reinforcement Learning
Flag this post
Parleying with the Principled
lesswrong.com·20m
🔢Homomorphic Encryption
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.com·1d
🌿Digital Gardens
Flag this post
Just complaining about LLM sycophancy (filler episode)
lesswrong.com·4h
✍Writing
Flag this post
Spending Less by Doing More
lesswrong.com·19h
🎯Reinforcement Learning
Flag this post
Lack of Social Grace is a Lack of Skill
lesswrong.com·20h
🗃️Zettelkasten
Flag this post
Human Values ≠ Goodness
lesswrong.com·1d
🌿Digital Gardens
Flag this post
Loading...Loading more...