Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท3d
๐ฏReinforcement Learning
Flag this post
Just complaining about LLM sycophancy (filler episode)
lesswrong.comยท2h
โWriting
Flag this post
Reason About Intelligence, Not AI
lesswrong.comยท1d
๐AI Interpretability
Flag this post
FTL travel and scientific realism
lesswrong.comยท1d
๐๏ธObservability
Flag this post
Youโre always stressed, your mind is always busy, you never have enough time
lesswrong.comยท2d
โWriting
Flag this post
Seattle Secular Solstice 2025 โ Dec 20th
lesswrong.comยท2d
๐ฟDigital Gardens
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.comยท4d
๐AI Interpretability
Flag this post
My YC Pitch
lesswrong.comยท1d
๐Open Source
Flag this post
Is it worrying that 95% of AI enterprise projects fail?
seangoedecke.comยท23h
On The Conservation of Rights
lesswrong.comยท4d
๐ฆRust
Flag this post
Reflections on 4 years of meta-honesty
lesswrong.comยท1d
๐ฎMessage Queues
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท5d
๐AI Interpretability
Flag this post
Halfhaven Digest #3
lesswrong.comยท3d
๐กRSS
Flag this post
Strategy-Stealing Argument Against AI Dealmaking
lesswrong.comยท2d
๐ฏReinforcement Learning
Flag this post
Loading...Loading more...