Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Alignment
🎯 AI Alignment
alignment research, AI safety, RLHF, value alignment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
72
posts in
10.9
ms
Designer babies. Self-improving
AI
. Are we ready for either?
🧩
Epistemics
Content type:
News
vox.com
·
1d
1 day ago
Actions for Designer babies. Self-improving AI. Are we ready for either?
umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at
AI-research
trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
🧠
LLMs
Content type:
Code
github.com
·
5d
5 days ago
·
r/SideProject
Actions for umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
A Unifying Lens on
Reward
Uncertainty in
RLHF
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
Guardian Angels: LLM Personalization for Productivity and Security
📊
AI Monitoring
gwern.net
·
4d
4 days ago
·
Hacker News
Actions for Guardian Angels: LLM Personalization for Productivity and Security
High Dynamic Range DIY Air Testing
🧑💻
Indie Hackers
jefftk.com
·
2d
2 days ago
Actions for High Dynamic Range DIY Air Testing
Coelho Mollo and Millière: The Vector Grounding Problem
🧠
LLMs
philosophyofbrains.com
·
6d
6 days ago
Actions for Coelho Mollo and Millière: The Vector Grounding Problem
Neglected Basics of
AI
Alignment
🧠
LLMs
lesswrong.com
·
4d
4 days ago
Actions for Neglected Basics of AI Alignment
Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
OpenClaw Won: How Big Tech Adopted the
AI
Agent
📊
AI Monitoring
thelettertwo.com
·
3d
3 days ago
Actions for OpenClaw Won: How Big Tech Adopted the AI Agent
Finding
Inner
Stillness at the Jinmandir
💡
Framework Thinking
srmdwpsitelive.kinsta.cloud
·
3d
3 days ago
Actions for Finding Inner Stillness at the Jinmandir
(VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to
AI
: Interviewing CEO Chris Best
🚀
Startups
Content type:
News
Content type:
Blog
braddelong.substack.com
·
6d
6 days ago
·
Substack
Actions for (VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to AI: Interviewing CEO Chris Best
A Regret Minimization Framework on Preference Learning in Large Language
Models
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for A Regret Minimization Framework on Preference Learning in Large Language Models
A Mike's-Eye View of ARC's
Research
⚙️
AI Infrastructure
lesswrong.com
·
1d
1 day ago
Actions for A Mike's-Eye View of ARC's Research
SLUUG Talk: Demystifying Large Language
Models
on Linux
🧠
LLMs
Content type:
Code
github.com
·
4d
4 days ago
·
DEV
Actions for SLUUG Talk: Demystifying Large Language Models on Linux
SecureBio Detection is Hiring Software Engineers
🧑💻
Indie Hackers
jefftk.com
·
6d
6 days ago
Actions for SecureBio Detection is Hiring Software Engineers
Representation-Aware Advantage Estimation: Your
Reward
Model
Provides More Than A
Scalar
Output
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output
Iliad is Hiring
🔍
GEO
lesswrong.com
·
4d
4 days ago
Actions for Iliad is Hiring
Hidden Consensus:Preference-Validity Compression in Human Feedback
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Hidden Consensus:Preference-Validity Compression in Human Feedback
Learnings from starting an
AI
safety
research
team
📊
AI Monitoring
lesswrong.com
·
5d
5 days ago
Actions for Learnings from starting an AI safety research team
Trajectory Geometry of Transformer Representations Across Layers
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Trajectory Geometry of Transformer Representations Across Layers
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help