Agentic AI and Security
๐MLOps
Flag this post
Forgive Savants Their Midwittery
lesswrong.comยท1d
๐๏ธZettelkasten
Flag this post
Sam Altman's track record of manipulation: some quotes from Karen Hao's "Empire of AI"
lesswrong.comยท1h
๐AI Interpretability
Flag this post
Freewriting in my head, and overcoming the โtwinge of startingโ
lesswrong.comยท2d
๐๏ธZettelkasten
Flag this post
Brainstorming 25 Questions I Am Interested In
lesswrong.comยท1d
๐๏ธZettelkasten
Flag this post
LLM-generated text is not testimony
lesswrong.comยท2d
โCategory Theory
Flag this post
A toy model of corrigibility
lesswrong.comยท1d
โกIncremental Computation
Flag this post
Just complaining about LLM sycophancy (filler episode)
lesswrong.comยท3h
โWriting
Flag this post
Why you shouldn't write a blog post every day for a month
lesswrong.comยท16h
โWriting
Flag this post
25 Que
lesswrong.comยท1d
๐๏ธZettelkasten
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท3d
๐ฏReinforcement Learning
Flag this post
Model welfare and open source
lesswrong.comยท1d
โกIncremental Computation
Flag this post
Evidence on language model consciousness
lesswrong.comยท2d
๐AI Interpretability
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท1d
โCategory Theory
Flag this post
You think you are in control?
lesswrong.comยท6h
๐ Self-Hosting
Flag this post
Me consuming five different forms of media at once to minimize the chance of a thought occurring
lesswrong.comยท1d
๐ฟDigital Gardens
Flag this post
Why do AI models use so many em-dashes?
seangoedecke.comยท5d
๐AI Interpretability
Flag this post
How Do We Evaluate the Quality of LLMs' Mathematical Responses?
lesswrong.comยท5d
โCategory Theory
Flag this post
An intro to the Tensor Economics blog
lesswrong.comยท5d
๐ขHomomorphic Encryption
Flag this post
Loading...Loading more...