Solving a problem with mindware
lesswrong.comยท9h
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Red Heart
lesswrong.comยท7h
๐Ÿฆ€Rust
Flag this post
A toy model of corrigibility
lesswrong.comยท1d
โšกIncremental Computation
Flag this post
To improve Rationality, create Situations
lesswrong.comยท8h
๐Ÿ—ƒ๏ธZettelkasten
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท1d
โˆ˜Category Theory
Flag this post
What's up with Anthropic predicting AGI by early 2027?
lesswrong.comยท7h
โšกIncremental Computation
Flag this post
You think you are in control?
lesswrong.comยท6h
๐Ÿ Self-Hosting
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท3d
๐ŸŽฏReinforcement Learning
Flag this post
How Powerful AI Gets Cheap
lesswrong.comยท7h
๐Ÿ”ขHomomorphic Encryption
Flag this post
High-Resistance Systems to Change: Can a Political Strategy Apply to Personal Change?
lesswrong.comยท5h
โšกIncremental Computation
Flag this post
FTL travel and scientific realism
lesswrong.comยท1d
โˆ˜Category Theory
Flag this post
Model welfare and open source
lesswrong.comยท1d
โšกIncremental Computation
Flag this post
Summary and Comments on Anthropic's Pilot Sabotage Risk Report
lesswrong.comยท4d
๐Ÿ”AI Interpretability
Flag this post