AI Alignment, Moral Philosophy, Value Systems, Practical Wisdom
Dimensions of logical time as economic strategies
lesswrong.com·2d
Follow-up to "My Empathy Is Rarely Kind"
lesswrong.com·1d
Reinforcement Learning for Next-Gen AML: From Rules to Dynamic Decisioning
pub.towardsai.net·12h
The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)
lesswrong.com·3d
Exploration hacking: can reasoning models subvert RL?
lesswrong.com·2d
Will AGI Emerge Through Self-Generated Reward Loops?
lesswrong.com·2d
A Browser That Buys You Flowers? Opera Just Changed the Web Forever
pub.towardsai.net·22h
Loading...Loading more...