LLM Hallucinations: An Internal Tug of War
lesswrong.comยท3d
๐Ÿ”AI Interpretability
Flag this post
An intro to the Tensor Economics blog
lesswrong.comยท4d
๐Ÿ”ขHomomorphic Encryption
Flag this post
Halfhaven Digest #3
lesswrong.comยท2d
๐Ÿ“กRSS
Flag this post
Model welfare and open source
lesswrong.comยท20h
โšกIncremental Computation
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.comยท15h
๐ŸŒฟDigital Gardens
Flag this post
Freewriting in my head, and overcoming the โ€œtwinge of startingโ€
lesswrong.comยท1d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
A Bayesian Explanation of Causal Models
lesswrong.comยท5d
๐Ÿ”—Dependent Types
Flag this post
Reflections on 4 years of meta-honesty
lesswrong.comยท17h
๐Ÿ“ฎMessage Queues
Flag this post
Resolving Newcomb's Problem Perfect Predictor Case
lesswrong.comยท5d
โšกIncremental Computation
Flag this post
Vaccination against ASI
lesswrong.comยท1d
๐Ÿ“ฎMessage Queues
Flag this post
Ink without haven
lesswrong.comยท2d
โœWriting
Flag this post
Secretly Loyal AIs: Threat Vectors and Mitigation Strategies
lesswrong.comยท1d
๐Ÿ”ขHomomorphic Encryption
Flag this post
A Sketch of Helpfulness Theory With Equivocal Principals
lesswrong.comยท5d
๐ŸŽฏReinforcement Learning
Flag this post
Emergent Introspective Awareness in Large Language Models
lesswrong.comยท3d
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Ohio House Bill 469
lesswrong.comยท6h
๐Ÿ”ŒEmbedded Systems
Flag this post
Why Civilizations Are Unstable (And What This Means for AI Alignment)
lesswrong.comยท4d
๐Ÿ”AI Interpretability
Flag this post
On The Conservation of Rights
lesswrong.comยท3d
๐Ÿ”ขHomomorphic Encryption
Flag this post