Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🛡️ AI Safety
AI alignment, AI safety, value alignment, AGI risk
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
59
posts in
11.3
ms
We Don't Talk Enough About The Aftermath Of Death Note
✨
Generative AI
aftermath.site
·
1d
Automated
Alignment
is Harder Than You Think
⚖️
AI Ethics
lesswrong.com
·
6d
From Descriptive to Prescriptive: Uncover the Social
Value
Alignment
of LLM-based Agents
🎮
Reinforcement Learning
arxiv.org
·
6d
Representation Engineering: a New Way of Understanding
Models
⚙️
Transformers
safe.ai
·
5d
The mistake of conflating intelligence and power
⚖️
AI Ethics
dwarkesh.com
·
4d
·
Hacker News
Ghosted Layers: Unconstrained Activation
Alignment
for Recovering Layer-Pruned LLMs
📝
LLMs
arxiv.org
·
3d
How the Grande Panda is strengthening Serbia’s car industry and where its weaknesses lie
📝
LLMs
serbianmonitor.com
·
5d
China Yuan Hits Three Year High as Markets Watch Trump Xi Summit
🔗
Deep Learning
moderndiplomacy.eu
·
6d
The Case for Evaluating
Model
Behaviors
⚖️
AI Ethics
lesswrong.com
·
9h
Trump goes to Beijing as Washington faces a changed world
🔐
Cybersecurity
mronline.org
·
5d
How China
interprets
Trump’s visit: Balance and boundaries
📝
LLMs
dailysabah.com
·
3d
At summit, China’s Xi eased tensions with Trump without giving ground
✨
Generative AI
washingtonpost.com
·
3d
'Thucydides trap': Xi warns Trump against 'mishandling' Taiwan
📝
LLMs
rediff.com
·
6d
XSearch: Explainable Code Search via Concept-to-Code
Alignment
🔍
RAG
arxiv.org
·
3d
Document-tuning instills durable animal compassion in LLMs (and generalizes to humans)
⚖️
AI Ethics
lesswrong.com
·
1h
President Xi to pay first state visit to US in more than a decade, Trump says
📝
LLMs
scmp.com
·
5d
·
r/SCMPauto
In Beijing, Trump faced America’s equal
🔐
Cybersecurity
chellaney.net
·
4d
The
safe-to-dangerous
shift is a fundamental
problem
for eval realism; but also for measuring awareness
🎮
Reinforcement Learning
lesswrong.com
·
6d
Risk
reports need to address deployment-time spread of misalignment
🔐
Cybersecurity
lesswrong.com
·
5d
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help