Back to article

Philipp D. Dubach

Aschenbrenner's Receipts (opens in new tab)

Covers 6 stories including Alignment faking in large language modelsDiscussed on Hacker News

Covers 6 related stories

Alignment faking in large language models

anthropic.com·

Anthropic and the Department of Defense to advance responsible AI in defense operations

Discussed on Hacker News

api-docs.deepseek.com·

Claimed DeepSeek-R1-Distill results largely fail to replicate

Discussed on r/LocalLLaMA

Frontier Models are Capable of In-context Scheming

Discussed on Hacker News

GPQA Diamond | Epoch AI

·

Microsoft in Talks With Chevron, Engine No. 1 Over $7 Billion Texas Power Plant

Discussed on Hacker News