Evidence on language model consciousness
lesswrong.comยท3d
๐AI Interpretability
Flag this post
Ink without haven
lesswrong.comยท3d
โWriting
Flag this post
Reflections on 4 years of meta-honesty
lesswrong.comยท2d
๐ฎMessage Queues
Flag this post
Economics and Transformative AI (by Tom Cunningham)
lesswrong.comยท2d
๐AI Interpretability
Flag this post
AISLE discovered three new OpenSSL vulnerabilities
lesswrong.comยท5d
๐ฆRust
Flag this post
Freewriting in my head, and overcoming the โtwinge of startingโ
lesswrong.comยท3d
๐๏ธZettelkasten
Flag this post
Build the life you actually want
lesswrong.comยท11h
๐ฟDigital Gardens
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท4d
๐ฏReinforcement Learning
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.comยท2d
๐ฟDigital Gardens
Flag this post
Decision theory when you can't make decisions
lesswrong.comยท2d
๐ฏReinforcement Learning
Flag this post
Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐๏ธObservability
Flag this post
Why Is Printing So Bad?
lesswrong.comยท2d
โWriting
Flag this post
LLM Hallucinations: An Internal Tug of War
lesswrong.comยท5d
๐AI Interpretability
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท6d
๐AI Interpretability
Flag this post
An intro to the Tensor Economics blog
lesswrong.comยท6d
๐ขHomomorphic Encryption
Flag this post
Loading...Loading more...