AI Alignment, Moral Philosophy, Value Systems, Practical Wisdom
Sharpening the Shears: 8 Lessons from Garden Leave
lesswrong.com·13h
Dimensions of logical time as economic strategies
lesswrong.com·1d
TAI #163: AI Unlocking History’s Secrets; Deepmind’s Aeneas Continues A Recent Trend
pub.towardsai.net·2d
The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)
lesswrong.com·2d
Follow-up to "My Empathy Is Rarely Kind"
lesswrong.com·14h
Exploration hacking: can reasoning models subvert RL?
lesswrong.com·1d
Loading...Loading more...