Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, AI Agents, Game Playing, Policy Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
175090
posts in
23.8
ms
Agile
Interception
of a Flying Target using Competitive Reinforcement Learning
arxiv.org
·
7h
🎓
RLHF
The Hidden Feedback Loop That Makes AI Agents
Truly
Intelligent
vinitpahwa.medium.com
·
10h
🤖
AI
State of
RL
for
reasoning
LLMs
aweers.de
·
2d
🎓
RLHF
Reinforcement
Learning
environments
and how to build them
unsloth.ai
·
4d
·
Discuss:
Hacker News
🎓
RLHF
Sample-Efficient
Hypergradient
Estimation for Decentralized Bi-Level Reinforcement Learning
arxiv.org
·
1d
🎓
RLHF
The
Cutting
Edge: Agents, Reasoning Models &
Multimodal
AI
medium.com
·
15h
🎭
Anthropic Claude
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide |
Abhishek
Nair
- Fractional CTO for Deep Tech & AI
padawanabhi.de
·
2d
·
Discuss:
DEV
🎓
RLHF
Explainable
Causal Reinforcement Learning for precision
oncology
clinical workflows under real-time policy constraints
dev.to
·
13h
·
Discuss:
DEV
🎭
Anthropic Claude
Modeling ballistic magnetization
reversals
via spin-orbit
torques
by reinforcement learning
link.aps.org
·
19h
🎓
RLHF
RL agents go from
face-planting
to
parkour
when researchers keep adding network layers
the-decoder.com
·
3d
🎓
RLHF
How Many Agents Are Too Many? The Hidden Cost of Multi-Agent Systems —
Anannya
Roy
Chowdhury
at AI Engineer Melbourne 2026
webdirections.org
·
8h
⛓️
LangChain
Launch an autonomous AI agent with
sandboxed
execution in 2
lines
of code
amaiya.github.io
·
10h
·
Discuss:
Hacker News
🤖
AI
The
Bounded
Autonomy
Spectrum: When AI Agents Should Ask Instead of Act
dev.to
·
1d
·
Discuss:
DEV
🎓
RLHF
From
Intelligent
Agents to
Smarter
Search: Insights from Modern AI Research
youtu.be
·
3d
·
Discuss:
DEV
🎭
Anthropic Claude
21 Reinforcement Learning (
RL
) Concepts Explained
Simply
newsletter.systemdesign.one
·
3d
🎓
RLHF
Dynamiks.ai
Introduces the
Quarterback
: The First Fully Autonomous AI Agent Enabling the Agentic Pipeline
prweb.com
·
12h
🤖
AI
From Advisory to Autonomous: What It Actually Takes to Make AI Agents Safe in
Manufacturing
ERP
zeehub.ai
·
2h
·
Discuss:
Hacker News
📞
Function Calling
The State of Agent Engineering Report
Overview
kdnuggets.com
·
21h
⛓️
LangChain
Some simple
economics
of AI?
marginalrevolution.com
·
6h
🤖
AI
A
Layered
Defense Model for Artificial Autonomous
Intelligent
Environments
mikail-eliyah.medium.com
·
2d
🎭
Anthropic Claude
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help