Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
RL, AI Agents, Game Playing, Policy Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
400
posts in
7.7
ms
Performance Variation in
Deep
Reinforcement
Learning
🧠
AI Research
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Performance Variation in Deep Reinforcement Learning
How to Implement a Model-Free
RL
Algorithm: A Step-by-Step Guide
🧠
AI Research
Content type:
Blog
ujangriswanto08.medium.com
·
1h
1 hour ago
Actions for How to Implement a Model-Free RL Algorithm: A Step-by-Step Guide
Q-Learning
(
Reinforcement
learning
): Bellman Equation,
Markov
Decision Processes, Q-Values, and…
🧠
AI Research
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Q-Learning (Reinforcement learning): Bellman Equation, Markov Decision Processes, Q-Values, and…
Researchers develop
AI-powered
railway control system for efficient urban train operation
🧠
AI Research
techxplore.com
·
17h
17 hours ago
Actions for Researchers develop AI-powered railway control system for efficient urban train operation
Edge
AI
enabled MIMO MC-CDMA for 6G
optimizing
spectrum and energy efficiency with SIC and
deep
reinforcement learning
🧠
AI Research
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning
Reinforcement
Learning
and
Optimal
Control Book (RIP Dimitri Bertsekas)
🧠
AI Research
Content type:
Academic
web.mit.edu
·
5d
5 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
🧠
AI Research
anjalishriva.com
·
1d
1 day ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
Reasoning
RL
in 2026: GRPO, DPO, RLVR,
Agentic
PO
& Beyond
🧠
AI Research
turingpost.com
·
3d
3 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Some Interesting Papers on RLVR
🧠
AI Research
lesswrong.com
·
1d
1 day ago
Actions for Some Interesting Papers on RLVR
Time-slip in
AI
sepsis models may inflate results, risking under- or overtreatment
🧠
AI Research
medicalxpress.com
·
5d
5 days ago
Actions for Time-slip in AI sepsis models may inflate results, risking under- or overtreatment
Scale Robot
Reinforcement
Learning
with NVIDIA Isaac Lab on Amazon SageMaker
AI
⚙️
AI Infrastructure
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI
Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using
Reinforcement
Learning
🧠
AI Research
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning
SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
🧠
AI Research
Content type:
Code
github.com
·
3d
3 days ago
·
r/opensource
Actions for SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
Google cofounder Sergey Brin says he uses the
game
of Go to explain the future of work
🧠
AI Research
Content type:
News
businessinsider.com
·
4d
4 days ago
Actions for Google cofounder Sergey Brin says he uses the game of Go to explain the future of work
Fast and Highly Expressive
Policy
Learning
for Offline
Reinforcement
Learning
via Bootstrapped Flow
Q-Learning
🧠
AI Research
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning
École secondaire Notre-Dame-du-Sault to hold graduation on June 24
🧠
Claude
sootoday.com
·
6d
6 days ago
Actions for École secondaire Notre-Dame-du-Sault to hold graduation on June 24
Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish
🧠
AI Research
digg.com
·
6d
6 days ago
Actions for Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish
2026 FIVB Volleyball Women's Nations League in Nanjing: Poland beats Czech Republic 3-0
🧮
Embedding Models
ecns.cn
·
6d
6 days ago
Actions for 2026 FIVB Volleyball Women's Nations League in Nanjing: Poland beats Czech Republic 3-0
TT-DAC-PS: Twin-Target Deterministic
Actor-Critic
with
Policy
Smoothing for Optimal Trade Execution
🧠
AI Research
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution
AI
Agent
Mastery & Coaching
🧠
Claude
ruv.io
·
3d
3 days ago
Actions for AI Agent Mastery & Coaching
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help