Weak-To-Strong Generalization
lesswrong.com·20h
Category Theory
Flag this post
Model welfare and open source
lesswrong.com·20h
Incremental Computation
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocks·3d·
Discuss: Hacker News
🦀Rust
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.com·1d
🔍AI Interpretability
Flag this post
Human Values ≠ Goodness
lesswrong.com·3h
🌿Digital Gardens
Flag this post
A toy model of corrigibility
lesswrong.com·4h
Incremental Computation
Flag this post
An intro to the Tensor Economics blog
lesswrong.com·4d
🦀Rust
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.com·17h
🌿Digital Gardens
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.com·7h
🌿Digital Gardens
Flag this post
Agentic AI and Security
martinfowler.com·5d·
🚀MLOps
Flag this post
Freewriting in my head, and overcoming the “twinge of starting”
lesswrong.com·1d
🗃️Zettelkasten
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.com·14h
🌿Digital Gardens
Flag this post
Evidence on language model consciousness
lesswrong.com·1d
🔍AI Interpretability
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.com·1d
🔍AI Interpretability
Flag this post
Doom from a Solution to the Alignment Problem
lesswrong.com·6h
Incremental Computation
Flag this post
25 Que
lesswrong.com·10h
🗃️Zettelkasten
Flag this post
An Opinionated Guide to Privacy Despite Authoritarianism
lesswrong.com·4d·
Discuss: r/privacy
🏠Self-Hosting
Flag this post
Ohio House Bill 469
lesswrong.com·6h
🔌Embedded Systems
Flag this post
Decision theory when you can't make decisions
lesswrong.com·1d
🎯Reinforcement Learning
Flag this post
Model Parameters as a Steganographic Private Channel
lesswrong.com·6d
λFunctional Programming
Flag this post