AIs behaving badly: An AI trained to deliberately make bad code will become bad at unrelated tasks, too
techxplore.com·2w
Preview
Report Post

AIs behaving badly Credit: Image generated by the editorial team using AI for illustrative purposes.

Artificial intelligence models that are trained to behave badly on a narrow task may generalize this behavior across unrelated tasks, such as offering malicious advice, suggests a new study. The research probes the mechanisms that cause this misaligned behavior, but further work must be done to find out why it happens and how to prevent it.

The study is published in the journal Nature.

Large language models (LLMs), such as OpenAI’s ChatGPT and Google’s Gemini, are becoming wid…

Similar Posts

Loading similar posts...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help