Existential Risk

Feeds to Scour
SubscribedAll
Scoured 30 posts in 10.5 ms

Advanced AI Safety Addendum

 🛡️AI Safety
cloud.google.com··Hacker News

Instrumental convergence and power-seeking

 🔭Longtermism  Content type: Academic
arxiv.org·

Solving the Worlds Hardest Problems with AI

 🎓Advanced content

Claude Fable 5 and new AI safety fables

 🎭Claude  Content type: News
interconnects.ai··Hacker News

Paving the way for agents in biology

 🛡️Content Moderation

Anthropic urges ‘temporary pause’ on AI development to discuss risks

 🎭Claude  Content type: News

Diffuse AI Control on Fuzzy Tasks

 🛡️AI Safety  Content type: Academic
arxiv.org·

Pareto-Guided Teacher Alignment for Fair Personalized Text Generation

 🎯Alignment Research  Content type: Academic
arxiv.org·

OpenAI Offers A New Policy Blueprint

 ⚠️Information Hazards  Content type: News  Content type: Blog

Mankirat47/Dao-Heart-3.13: An inspectable, symbolic value governance layer for AI, simulate then commit guards for warmth, agency, identity, and honesty, with falsifiable benchmarks.

 🛡️AI Safety  Content type: Code
github.com··Hacker News

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

 🎭Claude
latent.space··Hacker News

Sycophantic Praise: Evaluating Excessive Praise in Language Models

 📋Text Quality  Content type: Academic
arxiv.org·

The lawsuits that could give AI its ‘Big Tobacco’ moment

 ⚖️Tech Policy
politico.com
··Hacker News
Less-relevant results

Amazon employees ask Seattle to put the brakes on new data centers

 🛡️Content Moderation  Content type: News

Overview of Canada’s National Artificial Intelligence Strategy: AI for All

 🛡️AI Safety

A Geometric View for Understanding Concept Learning and Neuron Interpretation in Sparse Autoencoders

 🔍AI Interpretability  Content type: Academic
arxiv.org·

Epiplexity

 🎯Alignment Research  Content type: Blog
andys.blog··Hacker News

Trump wants the American public to own a piece of OpenAI. Nobody knows how that would work.

 ⚖️AI Governance  Content type: News
thenextweb.com·

RiskNet: A large-scale dataset of AI risk incidents from news with alignment and multi-dimensional annotations

 🛡️AI Safety  Content type: Academic
arxiv.org·

teia-igo-vs-claude-opus-4.8/README.en.md at main · joseteiadirector/teia-igo-vs-claude-opus-4.8

 🎭Claude  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help