Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comΒ·2d
🎯Reinforcement Learning
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comΒ·4d
πŸ”AI Interpretability
Flag this post
When Will AI Transform the Economy?
lesswrong.comΒ·5d
πŸ”AI Interpretability
Flag this post
Agentic AI and Security
martinfowler.comΒ·5dΒ·
πŸš€MLOps
Flag this post
AISLE discovered three new OpenSSL vulnerabilities
lesswrong.comΒ·3d
πŸ¦€Rust
Flag this post
Seattle Secular Solstice 2025 – Dec 20th
lesswrong.comΒ·1d
🌿Digital Gardens
Flag this post
Uncertain Updates: October 2025
lesswrong.comΒ·4d
⚑Incremental Computation
Flag this post
Brainstorming 25 Questions I Am Interested In
lesswrong.comΒ·11h
πŸ—ƒοΈZettelkasten
Flag this post
Reason About Intelligence, Not AI
lesswrong.comΒ·3h
πŸ”AI Interpretability
Flag this post
OpenAI Moves To Complete Potentially The Largest Theft In Human History
lesswrong.comΒ·2d
🌐Open Source
Flag this post
Agentic Monitoring for AI Control
lesswrong.comΒ·6d
πŸ‘οΈObservability
Flag this post
RSS feeds discovery strategies
blog.burkert.meΒ·6dΒ·
Discuss: Hacker News
πŸ“‘RSS
Flag this post
The Memetics of AI Successionism
lesswrong.comΒ·5d
πŸ”AI Interpretability
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.comΒ·15h
🌿Digital Gardens
Flag this post
Halfhaven Digest #3
lesswrong.comΒ·2d
πŸ“‘RSS
Flag this post
Please Do Not Sell B30A Chips to China
lesswrong.comΒ·4d
πŸ”ŒEmbedded Systems
Flag this post
Strategy-Stealing Argument Against AI Dealmaking
lesswrong.comΒ·1d
🎯Reinforcement Learning
Flag this post
You’re always stressed, your mind is always busy, you never have enough time
lesswrong.comΒ·1d
✍Writing
Flag this post
Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comΒ·3d
πŸ‘οΈObservability
Flag this post