Have We Trained AI to Lie to Itself (opens in new tab)
A conversation with leading alignment researcher David Dalrymple, AKA Davidad
Read the original articleA conversation with leading alignment researcher David Dalrymple, AKA Davidad
Read the original article