Skip to main content

ScourDiscover Docs

Discover About Docs Changelog

You are offline. Trying to reconnect...

Copied to clipboard

Unable to share or copy to clipboard

Back to article

alignmentforum.org7w7 weeks ago

Risk from fitness-seeking AIs: mechanisms and mitigations (opens in new tab)

|

|

Feeds

AI Alignment Forum alignmentforum.org

A community blog devoted to technical AI alignment research

Risk reports need to address deployment-time spread of misalignment5w5 weeks ago

Mechanistic estimation for expectations of random products5w5 weeks ago

The safe-to-dangerous shift is a fundamental problem for eval realism; but also for measuring awareness5w5 weeks ago

Keyboard Shortcuts

Navigation

Next / previous post: j/k
Open post: oorEnter
Preview post: v

Post Actions

Love post: a
Like post: l
Dislike post: d
Undo reaction: u
Save / unsave: s

Recommendations

Add interest / feed: Enter
Not interested: x

Go to

Home: gh
Interests: gi
Feeds: gf
Likes: gl
History: gy
Changelog: gc
Settings: gs
Discover: gb
Search: /

Pagination

Next page: n
Previous page: p

General

Show this help: ?
Submit feedback: !
Close modal / unfocus: Esc

Press ? anytime to show this help

Docs Blog (opens in new tab)Changelog Roadmap (opens in new tab)