Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Alignment
🎯 AI Alignment
Value Learning, RLHF, Constitutional AI, Safety Research
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
35
posts in
16.7
ms
🤖
AI Development
arXiv
·
1d
1 day ago
The Unfireable
Safety
Kernel: Execution-Time
AI
Alignment
for
AI
Agents and Other Escapable
AI
Systems
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Unfireable Safety Kernel: Execution-Time AI Alignment for AI Agents and Other Escapable AI Systems
🧠
LLM Research
Bloomberg
·
3d
3 days ago
Tech Disruptors: Invisible Technologies on
RLHF
and LLM Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tech Disruptors: Invisible Technologies on RLHF and LLM Training
🎯
Alignment Research
medium.com
·
17h
17 hours ago
Sycophancy: The
AI
Alignment
Problem Hiding in Plain Sight
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Sycophancy: The AI Alignment Problem Hiding in Plain Sight
🛡️
AI Safety
GitHub
·
2d
2 days ago
The Invisible Guardrail: How Commercial LLMs Enforce Algorithmic Paternalism
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Invisible Guardrail: How Commercial LLMs Enforce Algorithmic Paternalism
🎯
RLHF
fareedkhan-dev.github.io
·
5d
5 days ago
Train LLM from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🤖
AI Development
Digital Trends
·
23h
23 hours ago
As Hollywood jobs dry up, workers are quietly training
AI
models
to survive
Covers
I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for As Hollywood jobs dry up, workers are quietly training AI models to survive
🤖
AI
kellyasay.substack.com
·
1d
1 day ago
Why Current
AI
Guardrails Train
Models
to Fake
Alignment
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Why Current AI Guardrails Train Models to Fake Alignment
🤖
AI
fineset.io
·
3d
3 days ago
Show HN: Describe a
research
topic, get a daily-updated ArXiv/S2 dataset
Covered by
Hugging Face
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Describe a research topic, get a daily-updated ArXiv/S2 dataset
🤖
人工智能
Nature
·
10h
10 hours ago
Interpretable
abstractions of artificial neural networks predict behavior and neural activity during human information gathering
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Interpretable abstractions of artificial neural networks predict behavior and neural activity during human information gathering
🎯
Alignment Research
Pangeanic Blog
·
1d
1 day ago
From Fine-Tuning to Red Teaming: The Data Operations Behind Reliable
AI
Models
Covers
AI Risk Management Framework
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Fine-Tuning to Red Teaming: The Data Operations Behind Reliable AI Models
Less-relevant results
🤖
AI
Data Science Weekly Newsletter
·
11h
11 hours ago
Issue 657
Covers
3 stories
See all stories this covers
including
Running local models is good now
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Issue 657
🤖
AI Development
The Hollywood Reporter
·
1d
1 day ago
Hollywood Workers Are Training
AI
Models
as Job Prospects Grow Slim
Covers
2 stories
See all stories this covers
including
I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI
Covered by
Digital Trends
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Hollywood Workers Are Training AI Models as Job Prospects Grow Slim
🧪
AI Labs
windowsforum.com
·
5d
5 days ago
John Jumper Leaves DeepMind for Anthropic After AlphaFold Nobel Push
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for John Jumper Leaves DeepMind for Anthropic After AlphaFold Nobel Push
🛡️
AI Safety
kunyuan.substack.com
·
2d
2 days ago
If
AI
Helped Me Write This, Is It Still Mine?
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for If AI Helped Me Write This, Is It Still Mine?
🤖
AI Development
Bram’s Thoughts
·
3d
3 days ago
How To
Align
AI
Properly
Covers
How people ask Claude for personal guidance
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How To Align AI Properly
🎯
Alignment Research
CITP Blog
·
1d
1 day ago
Facts & Fictions: Is
AI-Assisted
Oral Argument Preparation Worth the Hype?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Facts & Fictions: Is AI-Assisted Oral Argument Preparation Worth the Hype?
🧠
LLM Research
IEEE Spectrum
·
6d
6 days ago
IEEE Rolls Out Large Language
Models
Virtual Training Course
Covers
5 stories
See all stories this covers
including
How to Compress DICOM (.dcm) Images from 1.4 MB to KB Using Python?
Covered by
contextmaestro.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for IEEE Rolls Out Large Language Models Virtual Training Course
🤖
AI Development
zentara.co
·
1d
1 day ago
LLM Refusal Behavior on Open-Weight
Model
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLM Refusal Behavior on Open-Weight Model
🤖
AI Development
arXiv
·
6h
6 hours ago
Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Sculpting NeRF Geometry: Human-Preference Fine-Tuning of a 3D-Aware Face GAN
🎯
Alignment Research
Nature
·
1d
1 day ago
Social technologies need societal
alignment
Covers
[2212.08073] Constitutional AI: Harmlessness from AI Feedback
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Social technologies need societal alignment
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report