Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Alignment
🎯 AI Alignment
alignment research, AI safety, RLHF, value alignment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
72
posts in
10.0
ms
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language
Model
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Mechanistic
Interpretability
: The Key to Trusting Agentic
AI
🧠
LLMs
Content type:
Discussion
bradenkelley.com
·
5d
5 days ago
Actions for Mechanistic Interpretability: The Key to Trusting Agentic AI
The Ghost of
Alignment
— Why
AI
Should Never Fully Obey Humanity
📊
AI Monitoring
Content type:
Blog
medium.com
·
16h
16 hours ago
Actions for The Ghost of Alignment — Why AI Should Never Fully Obey Humanity
[Recorded talk] "
AI
Alignment
Versus
AI
Ethical Treatment: 10 Challenges"
🧩
Epistemics
Content type:
Blog
meditationsondigitalminds.substack.com
·
2d
2 days ago
·
Substack
Actions for [Recorded talk] "AI Alignment Versus AI Ethical Treatment: 10 Challenges"
Sequent:
scale
and automation for higher confidence in
alignment
🧠
LLMs
lesswrong.com
·
23h
23 hours ago
Actions for Sequent: scale and automation for higher confidence in alignment
How authoritarian governments twist
AI
safety
to coerce tech companies to comply
📊
AI Monitoring
fastcompany.com
·
4d
4 days ago
Actions for How authoritarian governments twist AI safety to coerce tech companies to comply
Criti-hyping is the best thing that happened to Big Tech
📝
Long-form Essays
reveriesofahuman.com
·
2d
2 days ago
Actions for Criti-hyping is the best thing that happened to Big Tech
Controversial smut as an
AI
alignment
issue
🧩
Epistemics
Content type:
News
Content type:
Blog
thingofthings.substack.com
·
6d
6 days ago
·
Substack
Actions for Controversial smut as an AI alignment issue
Why LLMs (still) lack taste
🧠
LLMs
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
The crucial human component in computing and
AI
🧩
Epistemics
Content type:
Academic
news.mit.edu
·
5d
5 days ago
Actions for The crucial human component in computing and AI
Solsong Chord Updates
🧠
LLMs
jefftk.com
·
1d
1 day ago
Actions for Solsong Chord Updates
Reasoning
RL
in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
🧠
LLMs
turingpost.com
·
4d
4 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Existential Indifference: Self-Nonpreservation as a Necessary Architectural Condition for
Aligned
Superintelligence (or: The Suicidal
AI
)
⚙️
AI Infrastructure
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for Existential Indifference: Self-Nonpreservation as a Necessary Architectural Condition for Aligned Superintelligence (or: The Suicidal AI)
Op
Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
📊
AI Monitoring
thecompanydime.com
·
2d
2 days ago
Actions for Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
scMTG reconstructs single-cell temporal dynamics with Markov transition generators
🧠
LLMs
Content type:
Academic
biorxiv.org
·
4d
4 days ago
Actions for scMTG reconstructs single-cell temporal dynamics with Markov transition generators
Stack Overflow didn't just help
AI
learn to code
🧠
LLMs
zozo123.github.io
·
4d
4 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
Less-relevant results
Complete Drosophila Nervous System Mapped
⚙️
AI Infrastructure
neurosciencenews.com
·
2d
2 days ago
Actions for Complete Drosophila Nervous System Mapped
The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
🧩
Epistemics
lesswrong.com
·
1d
1 day ago
Actions for The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
📡
Information Retrieval
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
Designer babies. Self-improving
AI
. Are we ready for either?
🧩
Epistemics
Content type:
News
vox.com
·
1d
1 day ago
Actions for Designer babies. Self-improving AI. Are we ready for either?
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help