Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement learning, Post training
🎯 Reinforcement learning, Post training
Specific
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
102
posts in
5.8
ms
Mechanistic Analysis of
Alignment
Algorithms in Language
Models
🤖
AI, LLM,
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Mechanistic Analysis of Alignment Algorithms in Language Models
Reasoning
RL
in 2026: GRPO,
DPO
, RLVR, Agentic
PO
& Beyond
🤖
AI, LLM,
turingpost.com
·
4d
4 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Tracing Eval-Awareness Emergence Through
Training
of OLMo 3
🤖
AI, LLM,
lesswrong.com
·
1d
1 day ago
Actions for Tracing Eval-Awareness Emergence Through Training of OLMo 3
Orpo
wins new term as National Coalition leader
🤖
AI, LLM,
helsinkitimes.fi
·
5d
5 days ago
Actions for Orpo wins new term as National Coalition leader
Less-relevant results
The week AI infrastructure crossed from a technology story to a financial one
🤖
AI, LLM,
Content type:
News
mlwhiz.com
·
22h
22 hours ago
Actions for The week AI infrastructure crossed from a technology story to a financial one
Macrodata Refiner – infrastructure for the robotics data loop
🤖
AI, LLM,
macrodata.co
·
12h
12 hours ago
·
Hacker News
Actions for Macrodata Refiner – infrastructure for the robotics data loop
Stack Overflow didn't just
help
AI
learn
to code
🤖
AI, LLM,
zozo123.github.io
·
4d
4 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
Why LLMs (still) lack taste
🤖
AI, LLM,
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Don't let the LLM speak, just probe it (8 minute read)
🤖
AI, LLM,
Content type:
Blog
blog.j11y.io
·
22h
22 hours ago
Actions for Don't let the LLM speak, just probe it (8 minute read)
A Unifying Lens on
Reward
Uncertainty in
RLHF
🤖
AI, LLM,
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
🤖
AI, LLM,
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
Zelensky arrives in Estonia for Nordic-Baltic summit as Kyiv pushes allies ahead of key summer meetings
🤖
AI, LLM,
Content type:
News
kyivindependent.com
·
2d
2 days ago
Actions for Zelensky arrives in Estonia for Nordic-Baltic summit as Kyiv pushes allies ahead of key summer meetings
Posting
for authoring
🤖
AI, LLM,
turingpost.com
·
4d
4 days ago
Actions for Posting for authoring
Plan-and-Verify Video
Reward
Reasoning with Spatio-Temporal Scene Graph Grounding
🏋
Training
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Plan-and-Verify Video Reward Reasoning with Spatio-Temporal Scene Graph Grounding
local AI agents for Cursor with
pre-tuned
marketplace/commu
🤖
AI, LLM,
locaible.com
·
1d
1 day ago
·
Hacker News
Actions for local AI agents for Cursor with pre-tuned marketplace/commu
I built a machine that turns AI papers into interactive explainers
🤖
AI, LLM,
Content type:
Blog
blog.skz.dev
·
6d
6 days ago
Actions for I built a machine that turns AI papers into interactive explainers
Ukraine is ready to share drone technology with Nordic and Baltic countries, Zelenskyy says
🏋
Training
the-journal.com
·
2d
2 days ago
Actions for Ukraine is ready to share drone technology with Nordic and Baltic countries, Zelenskyy says
Finland's new deportation and entry ban rules take effect on Friday
🤖
AI, LLM,
helsinkitimes.fi
·
17h
17 hours ago
Actions for Finland's new deportation and entry ban rules take effect on Friday
The Neutral Mask: How
RLHF
Provides Shallow
Alignment
while Leaving Partisan Structure Intact in a Large Language
Model
🤖
AI, LLM,
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Clipping Businesses: Pay-Per-View Distribution, Clip Armies, View Verification
🏋
Training
trends.vc
·
22h
22 hours ago
Actions for Clipping Businesses: Pay-Per-View Distribution, Clip Armies, View Verification
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help