Economics and Transformative AI (by Tom Cunningham)
lesswrong.comยท1d
โšกIncremental Computation
Flag this post
Evidence on language model consciousness
lesswrong.comยท1d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท2d
๐ŸŽฏReinforcement Learning
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท20h
โˆ˜Category Theory
Flag this post
Model welfare and open source
lesswrong.comยท20h
โšกIncremental Computation
Flag this post
Agentic AI and Security
martinfowler.comยท5dยท
๐Ÿš€MLOps
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท1d
๐Ÿ”ขHomomorphic Encryption
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comยท7h
๐ŸŒฟDigital Gardens
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท4d
โšกIncremental Computation
Flag this post
A toy model of corrigibility
lesswrong.comยท4h
โšกIncremental Computation
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.comยท3d
โšกIncremental Computation
Flag this post
An intro to the Tensor Economics blog
lesswrong.comยท4d
๐Ÿ”ขHomomorphic Encryption
Flag this post
Doom from a Solution to the Alignment Problem
lesswrong.comยท6h
โšกIncremental Computation
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comยท17h
๐ŸŒฟDigital Gardens
Flag this post