Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Back to article
Philipp D. Dubach
4w
4 weeks ago
Aschenbrenner's Receipts
(opens in new tab)
Covers
6 stories
See all stories this covers
including
Alignment faking in large language models
Discussed on
Hacker News
Love
Like
Not for me
Save
|
|
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Covers 6 related stories
arxiv.org
·
78w
78 weeks ago
Alignment faking in large language models
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Alignment faking in large language models
anthropic.com
·
48w
48 weeks ago
Anthropic and the Department of Defense to advance responsible AI in defense operations
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Anthropic and the Department of Defense to advance responsible AI in defense operations
api-docs.deepseek.com
·
73w
73 weeks ago
Claimed DeepSeek-R1-Distill results largely fail to replicate
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Claimed DeepSeek-R1-Distill results largely fail to replicate
arxiv.org
·
79w
79 weeks ago
Frontier Models are Capable of In-context Scheming
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Frontier Models are Capable of In-context Scheming
Epoch AI
·
27w
27 weeks ago
GPQA Diamond | Epoch AI
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GPQA Diamond | Epoch AI
bloomberg.com
·
11w
11 weeks ago
Microsoft in Talks With Chevron, Engine No. 1 Over $7 Billion Texas Power Plant
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Microsoft in Talks With Chevron, Engine No. 1 Over $7 Billion Texas Power Plant
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report