Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RLHF, Reward Models, Policy, Agents
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
175033
posts in
23.7
ms
State of
RL
for
reasoning
LLMs
aweers.de
·
2d
💬
LLMs
Agile
Interception
of a Flying Target using Competitive Reinforcement Learning
arxiv.org
·
7h
🕵️
LLM Agents
From Passive Observer to Active
Critic
: Reinforcement Learning
Elicits
Process Reasoning for Robotic Manipulation
arxiv.org
·
1d
🕵️
LLM Agents
Modeling ballistic magnetization
reversals
via spin-orbit
torques
by reinforcement learning
link.aps.org
·
19h
🔥
PyTorch
Explainable
Causal Reinforcement Learning for precision
oncology
clinical workflows under real-time policy constraints
dev.to
·
13h
·
Discuss:
DEV
🤖
AI
The Hidden Feedback Loop That Makes AI Agents
Truly
Intelligent
vinitpahwa.medium.com
·
10h
🕵️
LLM Agents
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide |
Abhishek
Nair
- Fractional CTO for Deep Tech & AI
padawanabhi.de
·
2d
·
Discuss:
DEV
🕵️
LLM Agents
Reinforcement
Learning
environments
and how to build them
unsloth.ai
·
4d
·
Discuss:
Hacker News
🕵️
LLM Agents
Newcomb
's
Paradox
Simulation
lesswrong.com
·
6h
🕵️
LLM Agents
Structured
Outputs
— The Type System for Agents
medium.com
·
8h
🕵️
LLM Agents
Show HN: New idea for
automatically
teaching
your agent new skills
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🕵️
LLM Agents
Rate Limit
Cascading
: The
Silent
Budget Killer in Multi-Agent Systems
dev.to
·
10h
·
Discuss:
DEV
🦀
Rust
RL agents go from
face-planting
to
parkour
when researchers keep adding network layers
the-decoder.com
·
3d
🤖
Transformers
From Advisory to Autonomous: What It Actually Takes to Make AI Agents Safe in
Manufacturing
ERP
zeehub.ai
·
2h
·
Discuss:
Hacker News
🕵️
LLM Agents
How Many Agents Are Too Many? The Hidden Cost of Multi-Agent Systems —
Anannya
Roy
Chowdhury
at AI Engineer Melbourne 2026
webdirections.org
·
8h
🕵️
LLM Agents
Launch an autonomous AI agent with
sandboxed
execution in 2
lines
of code
amaiya.github.io
·
10h
·
Discuss:
Hacker News
🤖
AI
21 Reinforcement Learning (
RL
) Concepts Explained
Simply
newsletter.systemdesign.one
·
3d
🕵️
LLM Agents
Agentic AI in Action — Part 14 - Building a Store Performance
Monitoring
Agent using LLMs and
Maps
pub.towardsai.net
·
6h
🕵️
LLM Agents
AI vs. Machine Learning: Understanding the
Differences
and Real-World
Applications
databricks.com
·
23h
🧠
Machine Learning
The
Cutting
Edge: Agents, Reasoning Models &
Multimodal
AI
medium.com
·
15h
🕵️
LLM Agents
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help