Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comΒ·2d
π―Reinforcement Learning
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comΒ·4d
πAI Interpretability
Flag this post
When Will AI Transform the Economy?
lesswrong.comΒ·5d
πAI Interpretability
Flag this post
Agentic AI and Security
πMLOps
Flag this post
AISLE discovered three new OpenSSL vulnerabilities
lesswrong.comΒ·3d
π¦Rust
Flag this post
Seattle Secular Solstice 2025 β Dec 20th
lesswrong.comΒ·1d
πΏDigital Gardens
Flag this post
Uncertain Updates: October 2025
lesswrong.comΒ·4d
β‘Incremental Computation
Flag this post
Brainstorming 25 Questions I Am Interested In
lesswrong.comΒ·11h
ποΈZettelkasten
Flag this post
Reason About Intelligence, Not AI
lesswrong.comΒ·3h
πAI Interpretability
Flag this post
OpenAI Moves To Complete Potentially The Largest Theft In Human History
lesswrong.comΒ·2d
πOpen Source
Flag this post
Agentic Monitoring for AI Control
lesswrong.comΒ·6d
ποΈObservability
Flag this post
RSS feeds discovery strategies
π‘RSS
Flag this post
The Memetics of AI Successionism
lesswrong.comΒ·5d
πAI Interpretability
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.comΒ·15h
πΏDigital Gardens
Flag this post
Halfhaven Digest #3
lesswrong.comΒ·2d
π‘RSS
Flag this post
Please Do Not Sell B30A Chips to China
lesswrong.comΒ·4d
πEmbedded Systems
Flag this post
Strategy-Stealing Argument Against AI Dealmaking
lesswrong.comΒ·1d
π―Reinforcement Learning
Flag this post
Youβre always stressed, your mind is always busy, you never have enough time
lesswrong.comΒ·1d
βWriting
Flag this post
Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comΒ·3d
ποΈObservability
Flag this post
Loading...Loading more...