Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Alignment
🧭 LLM Alignment
AI alignment, RLHF, model behavior, interpretability
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
155
posts in
6.4
ms
Stack Overflow didn't just help
AI
learn to code
🛡️
AI Safety
zozo123.github.io
·
3d
3 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
A free diagnostic for the Claude Certified Architect exam
🛡️
AI Safety
Content type:
Discussion
Content type:
Tutorial
claudecertifiedarchitects.com
·
1d
1 day ago
·
Hacker News
Actions for A free diagnostic for the Claude Certified Architect exam
Less-relevant results
Is the Space Pope Reptilian?
🛡️
AI Safety
Content type:
News
tearsinrain.ai
·
14h
14 hours ago
·
Hacker News
Actions for Is the Space Pope Reptilian?
Import
AI
460: Reward hacking society, RSI data from Anthropic; and
RL-based
quadcopter racing
🤖
AGI
Content type:
News
Content type:
Blog
importai.substack.com
·
2d
2 days ago
·
Substack
Actions for Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing
The crucial human component in computing and
AI
🛡️
AI Safety
Content type:
Academic
news.mit.edu
·
5d
5 days ago
Actions for The crucial human component in computing and AI
Learning to Attack and Defend: Adaptive
Red
Teaming
of Language
Models
via GRPO
🛡️
AI Safety
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Learning to Attack and Defend: Adaptive Red Teaming of Language Models via GRPO
Sequent: scale and automation for higher confidence in
alignment
🤖
AGI
lesswrong.com
·
12h
12 hours ago
Actions for Sequent: scale and automation for higher confidence in alignment
AWS Destroyed the Value Proposition for Bedrock
🦋
ATProto
Content type:
Blog
securosis.com
·
16h
16 hours ago
Actions for AWS Destroyed the Value Proposition for Bedrock
Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker
AI
🎭
AI Simulators
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI
Nvidia Nemotron 3 Ultra
🎭
AI Simulators
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Breaking free of a single datacenter: Practical geo-distributed
AI
operations with the k0smos platforms
🛡️
AI Safety
Content type:
Blog
cncf.io
·
2d
2 days ago
Actions for Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms
umair-tareen/philosopher-council: An eleven-philosopher
LLM
council - ask it questions or point it at
AI-research
trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
🎭
AI Simulators
Content type:
Code
github.com
·
5d
5 days ago
·
r/SideProject
Actions for umair-tareen/philosopher-council: An eleven-philosopher LLM council - ask it questions or point it at AI-research trends. Claude-powered deliberation through the four classical branches of philosophy. Methodology, not metaphysics.
The Stoic Path to Actual
AI
Safety: Three Practical Steps for Industry and Individuals
🛡️
AI Safety
oodaloop.com
·
2d
2 days ago
Actions for The Stoic Path to Actual AI Safety: Three Practical Steps for Industry and Individuals
Raize Orion Multi-framework GRC with anchored NIS2 reporting clocks
🛡️
AI Safety
raizehq.dev
·
4d
4 days ago
·
Hacker News
Actions for Raize Orion Multi-framework GRC with anchored NIS2 reporting clocks
DOG-DPO
:Dynamic Optimization in Geometry for Safety
Alignment
🛡️
AI Safety
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment
Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
🛡️
AI Safety
thecompanydime.com
·
2d
2 days ago
Actions for Op Ed: Consultant Tony O’Connor On The Agentic Trojan Horse
‘I don’t want my children to grow up in a broken family’: Abused husbands in S’pore who are unseen
🔍
Epistemics
straitstimes.com
·
4d
4 days ago
·
r/singapore
Actions for ‘I don’t want my children to grow up in a broken family’: Abused husbands in S’pore who are unseen
The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
🛡️
AI Safety
lesswrong.com
·
18h
18 hours ago
Actions for The Three Filters: Why Almost Every Plan to Survive ASI Fails Miserably
X-VPN proves its privacy credentials with new independent no-logs audit
🛡️
AI Safety
Content type:
News
techradar.com
·
2d
2 days ago
Actions for X-VPN proves its privacy credentials with new independent no-logs audit
SecureBio Detection is Hiring Software Engineers
🛡️
AI Safety
jefftk.com
·
5d
5 days ago
Actions for SecureBio Detection is Hiring Software Engineers
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help