Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🤖 Reinforcement Learning
Agents
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
407
posts in
5.4
ms
Performance Variation in
Deep
Reinforcement
Learning
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Performance Variation in Deep Reinforcement Learning
How to Implement a Model-Free
RL
Algorithm: A Step-by-Step Guide
🤖
AI
Content type:
Blog
ujangriswanto08.medium.com
·
14h
14 hours ago
Actions for How to Implement a Model-Free RL Algorithm: A Step-by-Step Guide
Q-Learning
(
Reinforcement
learning
): Bellman Equation, Markov Decision Processes, Q-Values, and…
🤖
Machine Learning
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Q-Learning (Reinforcement learning): Bellman Equation, Markov Decision Processes, Q-Values, and…
Researchers develop AI-powered railway control system for efficient urban train operation
🤖
Machine Learning
techxplore.com
·
1d
1 day ago
Actions for Researchers develop AI-powered railway control system for efficient urban train operation
SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
🤖
Robotics
Content type:
Code
github.com
·
4d
4 days ago
·
r/opensource
Actions for SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
🤖
Transformers
anjalishriva.com
·
2d
2 days ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
Reinforcement
Learning
and Optimal Control Book (RIP Dimitri Bertsekas)
🤖
Machine Learning
Content type:
Academic
web.mit.edu
·
6d
6 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Reasoning
RL
in 2026: GRPO, DPO, RLVR,
Agentic
PO
& Beyond
🤖
AI
turingpost.com
·
4d
4 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Deep
Reinforcement
Learning
for Adaptive Power Allocation in ISAC Systems with Mobile Target
🤖
Robotics
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Deep Reinforcement Learning for Adaptive Power Allocation in ISAC Systems with Mobile Target
Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary
Deep
Reinforcement
Learning
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning
Multi-agent
rendezvous in fluid flows via
reinforcement
learning
🤖
Robotics
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Multi-agent rendezvous in fluid flows via reinforcement learning
TT-DAC-PS: Twin-Target Deterministic
Actor-Critic
with
Policy
Smoothing for Optimal Trade Execution
🤖
AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution
Deterministic
Policy
Gradient for
Learning
Equilibrium in Time-Inconsistent Control Problems
🤖
AI
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Deterministic Policy Gradient for Learning Equilibrium in Time-Inconsistent Control Problems
Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using
Reinforcement
Learning
🤖
Robotics
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning
Improving Generalization and Data Efficiency with Diffusion in Offline
Multi-agent
RL
🗄️
Vector Databases
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Improving Generalization and Data Efficiency with Diffusion in Offline Multi-agent RL
Fast and Highly Expressive
Policy
Learning
for Offline
Reinforcement
Learning
via Bootstrapped Flow
Q-Learning
🗣️
Speech Recognition
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning
Phi-Actor-Critic
: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria
🤖
AI
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria
Structure-Conditioned
Actor-Critic
Branches for Quality-Diversity
Reinforcement
Learning
⚡
SIMD Optimization
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Structure-Conditioned Actor-Critic Branches for Quality-Diversity Reinforcement Learning
Merging model-based control with
multi-agent
reinforcement
learning
for
multi-agent
cooperative teaming strategies
🤖
Robotics
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies
KinematicRL: A Sim-to-Real
Reinforcement
Learning
Framework For Social Navigation With Kinodynamic Feasibility
🤖
Robotics
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for KinematicRL: A Sim-to-Real Reinforcement Learning Framework For Social Navigation With Kinodynamic Feasibility
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help