Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Alignment
🎯 AI Alignment
alignment research, AI safety, RLHF, value alignment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
76
posts in
6.6
ms
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language
Model
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🧠
LLMs
Content type:
Discussion
bradenkelley.com
·
5d
5 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
The Ghost of
Alignment
— Why
AI
Should Never Fully Obey Humanity
📊
AI Monitoring
Content type:
Blog
medium.com
·
13h
13 hours ago
Actions for The Ghost of Alignment — Why AI Should Never Fully Obey Humanity
[Recorded talk] "
AI
Alignment
Versus
AI
Ethical Treatment: 10 Challenges"
🧩
Epistemics
Content type:
Blog
meditationsondigitalminds.substack.com
·
2d
2 days ago
·
Substack
Actions for [Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"
Sequent:
scale
and automation for higher confidence in
alignment
🧠
LLMs
lesswrong.com
·
19h
19 hours ago
Actions for Sequent: scale and automation for higher confidence in alignment
From
oversight
to coercion: How authoritarian governments are twisting
AI
safety
to get tech companies to fall in line
🧩
Epistemics
theconversation.com
·
6d
6 days ago
Actions for From oversight to coercion: How authoritarian governments are twisting AI safety to get tech companies to fall in line
Criti-hyping is the best thing that happened to Big Tech
📝
Long-form Essays
reveriesofahuman.com
·
2d
2 days ago
Actions for Criti-hyping is the best thing that happened to Big Tech
Solsong Chord Updates
🧠
LLMs
jefftk.com
·
22h
22 hours ago
Actions for Solsong Chord Updates
Controversial smut as an
AI
alignment
issue
🧩
Epistemics
Content type:
News
Content type:
Blog
thingofthings.substack.com
·
5d
5 days ago
·
Substack
Actions for Controversial smut as an AI alignment issue
Why LLMs (still) lack taste
🧠
LLMs
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
The crucial human component in computing and
AI
🧩
Epistemics
Content type:
Academic
news.mit.edu
·
5d
5 days ago
Actions for The crucial human component in computing and AI
Less-relevant results
Designer babies. Self-improving
AI
. Are we ready for either?
🧩
Epistemics
Content type:
News
vox.com
·
23h
23 hours ago
Actions for Designer babies. Self-improving AI. Are we ready for either?
Is the Space Pope Reptilian?
🧩
Epistemics
Content type:
News
tearsinrain.ai
·
21h
21 hours ago
·
Hacker News
Actions for Is the Space Pope Reptilian?
Op
Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
📊
AI Monitoring
thecompanydime.com
·
2d
2 days ago
Actions for Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
Reasoning
RL
in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
🧠
LLMs
turingpost.com
·
4d
4 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Existential Indifference: Self-Nonpreservation as a Necessary Architectural Condition for
Aligned
Superintelligence (or: The Suicidal
AI
)
⚙️
AI Infrastructure
Content type:
Academic
arxiv.org
·
7h
7 hours ago
Actions for Existential Indifference: Self-Nonpreservation as a Necessary Architectural Condition for Aligned Superintelligence (or: The Suicidal AI)
scMTG reconstructs single-cell temporal dynamics with Markov transition generators
🧠
LLMs
Content type:
Academic
biorxiv.org
·
4d
4 days ago
Actions for scMTG reconstructs single-cell temporal dynamics with Markov transition generators
The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
🧩
Epistemics
lesswrong.com
·
1d
1 day ago
Actions for The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
Stack Overflow didn't just help
AI
learn to code
🧠
LLMs
zozo123.github.io
·
3d
3 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
Complete Drosophila Nervous System Mapped
⚙️
AI Infrastructure
neurosciencenews.com
·
2d
2 days ago
Actions for Complete Drosophila Nervous System Mapped
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help