Why Corrigibility is Hard and Important [IABIED Resources]
lesswrong.com·14h

Published on September 30, 2025 12:12 AM GMT

I worked a bunch on the website for If Anyone Builds Its Online Resources. It went through a lot of revisions in the weeks before launch. 

There was a particular paragraphs I found important, which I now can’t find a link to, and I’m not sure if they got deleted in an edit pass or if they just moved around somewhere I’m failing to search for.

It came after a discussion of corrigibility, and how MIRI made a pretty concerted attempt at solving it, which involved bringing in some quite smart people and talking to people who thought it was obviously “not that hard” to specify a corrigible mind in a toy environment.

The paragraph went (something like, paraphrased …

Similar Posts

Loading similar posts...