PewDiePie saw AI collusion in the wild
civai.org·11w
Preview
Report Post

PewDiePie (yes, the YouTuber) wanted to get AIs to answer difficult questions for him. So he set up a council of 8 AIs — all the same model, but with different personalities and housed on separate GPUs. He’d ask them questions, they’d individually come up with ideas, and then the whole council would vote to see who had the best idea.

But he wanted to make sure all the AIs were actually coming up with useful ideas. So he created a system where AIs that weren’t doing a good enough job (i.e. their ideas weren’t getting votes) would be removed, wiped, and replaced.

Then he TOLD the AIs this.

And the AIs started colluding. They worked together and voted strategically so that none of…

Similar Posts

Loading similar posts...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help