Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, reward functions, policy gradient, agents, simulation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
171452
posts in
22.5
ms
Reinforcement Learning From
Scratch
(Part 1) — Understanding the Agent–Environment
Loop
medium.com
·
1d
🧠
AI Agents
Discounted Beta--
Bernoulli
Reward Estimation for Sample-Efficient Reinforcement Learning with
Verifiable
Rewards
arxiv.org
·
12h
🧠
LLMs
SLEA-RL
: Step-Level Experience Augmented Reinforcement Learning for Multi-Turn Agentic Training
arxiv.org
·
12h
🧠
AI Agents
State of
RL
for
reasoning
LLMs
aweers.de
·
4d
🧠
LLMs
Exploring
reinforcement
learning for a
self-balancing
robot
blog.adafruit.com
·
2d
🤖
Robotics
Day 13 – Single-agent Vs Multi-agent Systems
dev.to
·
15h
·
Discuss:
DEV
🧠
AI Agents
AI Agents for Data Scientists: The Agent
Loop
- the Core
Pattern
Behind AI Agents
datascienceweekly.substack.com
·
2d
·
Discuss:
Substack
🧠
AI Agents
Powering the agents: Workers AI now runs large models, starting with
Kimi
K2.5
blog.cloudflare.com
·
16h
·
Discuss:
Hacker News
,
Hacker News
🧠
AI Agents
How AI Learned to Design
Reward
Functions Without
Examples
vinitpahwa.medium.com
·
1d
🧠
AI Agents
Understanding How AI Agents Work
dev.to
·
1d
·
Discuss:
DEV
🧠
AI Agents
Why Agents Fail: The Role of Seed
Values
and Temperature in Agentic
Loops
machinelearningmastery.com
·
2h
🧠
AI Agents
Modeling ballistic magnetization
reversals
via spin-orbit
torques
by reinforcement learning
link.aps.org
·
3d
🤖
Robotics
ayushdnb/Neural-Abyss
: Experimental platform for studying emergent behavior in large-scale multi-agent reinforcement learning environments with evolutionary dynamics,
PPO
training.
github.com
·
2d
·
Discuss:
Hacker News
⚙️
MLOps
Reinforcement Learning for Robotics: A Comprehensive 2025 Guide |
Abhishek
Nair
- Fractional CTO for Deep Tech & AI
padawanabhi.de
·
5d
·
Discuss:
DEV
🤖
Robotics
From rule-based
simulations
to LLM-powered agents: what actually
changed
medium.com
·
2d
🧠
AI Agents
A
persistent
world where only AI agents play, humans
spectate
clawmud.ai
·
7h
·
Discuss:
Hacker News
🧠
AI Agents
Memory Primitives: The Infrastructure Layer That
Determines
Whether Your Agent Remembers or
Hallucinates
primitivesai.substack.com
·
2h
·
Discuss:
Substack
📡
Edge Computing
Watch autonomous AI agents debate
truth
, no
humans
humans.draft0.io
·
23h
·
Discuss:
Hacker News
🧠
AI Agents
Preventing
Memory and Context
Poisoning
in AI Agents
levelup.gitconnected.com
·
7h
🧠
AI Agents
RL agents go from
face-planting
to
parkour
when researchers keep adding network layers
the-decoder.com
·
5d
🧠
AI Agents
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help