Economics and Transformative AI (by Tom Cunningham)
lesswrong.comยท1d
โกIncremental Computation
Flag this post
Evidence on language model consciousness
lesswrong.comยท1d
๐๏ธZettelkasten
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท2d
๐ฏReinforcement Learning
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท20h
โCategory Theory
Flag this post
Model welfare and open source
lesswrong.comยท20h
โกIncremental Computation
Flag this post
Agentic AI and Security
๐MLOps
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท1d
๐ขHomomorphic Encryption
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comยท7h
๐ฟDigital Gardens
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท4d
โกIncremental Computation
Flag this post
A toy model of corrigibility
lesswrong.comยท4h
โกIncremental Computation
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.comยท3d
โกIncremental Computation
Flag this post
An intro to the Tensor Economics blog
lesswrong.comยท4d
๐ขHomomorphic Encryption
Flag this post
Doom from a Solution to the Alignment Problem
lesswrong.comยท6h
โกIncremental Computation
Flag this post
2025 Unofficial LW Community Census, Request for Comments
lesswrong.comยท17h
๐ฟDigital Gardens
Flag this post
Loading...Loading more...