A toy model of corrigibility
lesswrong.comΒ·4h
⚑Incremental Computation
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.comΒ·1d
πŸ”AI Interpretability
Flag this post
Weak-To-Strong Generalization
lesswrong.comΒ·20h
∘Category Theory
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocksΒ·3dΒ·
Discuss: Hacker News
πŸ¦€Rust
Flag this post
Reason About Intelligence, Not AI
lesswrong.comΒ·3h
πŸ”AI Interpretability
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comΒ·7h
🌿Digital Gardens
Flag this post
Ohio House Bill 469
lesswrong.comΒ·6h
πŸ”’Homomorphic Encryption
Flag this post
Model welfare and open source
lesswrong.comΒ·20h
⚑Incremental Computation
Flag this post
25 Que
lesswrong.comΒ·10h
πŸ—ƒοΈZettelkasten
Flag this post
Human Values β‰  Goodness
lesswrong.comΒ·3h
🌿Digital Gardens
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comΒ·1d
πŸ”’Homomorphic Encryption
Flag this post
Brainstorming 25 Questions I Am Interested In
lesswrong.comΒ·10h
πŸ—ƒοΈZettelkasten
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comΒ·2d
🎯Reinforcement Learning
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comΒ·17h
🌿Digital Gardens
Flag this post
An intro to the Tensor Economics blog
lesswrong.comΒ·4d
πŸ”’Homomorphic Encryption
Flag this post
You’re always stressed, your mind is always busy, you never have enough time
lesswrong.comΒ·1d
✍Writing
Flag this post
Evidence on language model consciousness
lesswrong.comΒ·1d
πŸ”AI Interpretability
Flag this post
My YC Pitch
lesswrong.comΒ·12h
🌐Open Source
Flag this post