My YC Pitch
lesswrong.comยท12h
๐Open Source
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.comยท1d
๐AI Interpretability
Flag this post
Model welfare and open source
lesswrong.comยท20h
โกIncremental Computation
Flag this post
A toy model of corrigibility
lesswrong.comยท4h
โกIncremental Computation
Flag this post
Doom from a Solution to the Alignment Problem
lesswrong.comยท6h
โกIncremental Computation
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท20h
โCategory Theory
Flag this post
I Wondered Why I Procrastinate Even On Things I Am "Passionate" About
lesswrong.comยท7h
๐ฟDigital Gardens
Flag this post
Agentic AI and Security
๐MLOps
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท1d
๐ขHomomorphic Encryption
Flag this post
Centralization begets stagnation
lesswrong.comยท2d
๐Open Source
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท2d
๐ฏReinforcement Learning
Flag this post
Reason About Intelligence, Not AI
lesswrong.comยท3h
๐AI Interpretability
Flag this post
Ohio House Bill 469
lesswrong.comยท6h
๐Embedded Systems
Flag this post
Decision theory when you can't make decisions
lesswrong.comยท1d
๐ฏReinforcement Learning
Flag this post
FTL travel and scientific realism
lesswrong.comยท16h
๐๏ธObservability
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท4d
๐AI Interpretability
Flag this post
An intro to the Tensor Economics blog
lesswrong.comยท4d
๐ขHomomorphic Encryption
Flag this post
Agentic Monitoring for AI Control
lesswrong.comยท6d
๐๏ธObservability
Flag this post
Loading...Loading more...