Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท6d
๐Cybernetic Economics
Flag this post
Not of myself
lies2light.comยท6d
๐ญphilosophy
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท5d
๐ฏEntropy Coding
Flag this post
The Fallacy of Storytelling
brajeshwar.comยท6d
๐ฏEntropy Coding
Flag this post
Debugging Despair ~> A bet about Satisfaction and Values
lesswrong.comยท5d
๐Cybernetic Economics
Flag this post
Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท6d
๐Cybernetic Economics
Flag this post
E-Paper Clock
byronknoll.blogspot.comยท6d
โป๏ธSustainable Tech
Flag this post
Kaitlin Butts Signs to Republic Records, Readies New EP
savingcountrymusic.comยท6d
๐ฟgit
Flag this post
iOS Keyboard Bugs
pxlnv.comยท6d
โจ๏ธText-based Interfaces
Flag this post
The $1 hack
fanlesstech.comยท6d
โป๏ธSustainable Tech
Flag this post
Returning HTTP 404 Responses Instead of 403 for Unauthorised Access
ashallendesign.co.ukยท6d
๐ฅP2P Networks
Flag this post
Strategy-Stealing Argument Against AI Dealmaking
lesswrong.comยท5d
๐Cybernetic Economics
Flag this post
Sonnet 4.5's eval gaming seriously undermines alignment evals, and this seems caused by training on alignment evals
lesswrong.comยท6d
๐ฏEntropy Coding
Flag this post
Ti Book HDD Flex Cable Woes
tinkerdifferent.comยท6d
โป๏ธSustainable Tech
Flag this post