Weak-To-Strong Generalization
lesswrong.comยท20h
โˆ˜Category Theory
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocksยท3dยท
Discuss: Hacker News
๐Ÿฆ€Rust
Flag this post
A toy model of corrigibility
lesswrong.comยท4h
โšกIncremental Computation
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comยท7h
๐ŸŒฟDigital Gardens
Flag this post
Model welfare and open source
lesswrong.comยท20h
โšกIncremental Computation
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท2d
๐ŸŽฏReinforcement Learning
Flag this post
Evidence on language model consciousness
lesswrong.comยท1d
๐Ÿ”AI Interpretability
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comยท17h
๐ŸŒฟDigital Gardens
Flag this post
Reason About Intelligence, Not AI
lesswrong.comยท3h
๐Ÿ”AI Interpretability
Flag this post
An intro to the Tensor Economics blog
lesswrong.comยท4d
๐Ÿ”ขHomomorphic Encryption
Flag this post
Freewriting in my head, and overcoming the โ€œtwinge of startingโ€
lesswrong.comยท1d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Agentic AI and Security
martinfowler.comยท5dยท
๐Ÿš€MLOps
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.comยท1d
๐Ÿ”AI Interpretability
Flag this post
Ohio House Bill 469
lesswrong.comยท6h
๐Ÿ”ŒEmbedded Systems
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.comยท3d
๐Ÿ”AI Interpretability
Flag this post
25 Que
lesswrong.comยท10h
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Human Values โ‰  Goodness
lesswrong.comยท3h
๐ŸŒฟDigital Gardens
Flag this post