Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
395
posts in
5.7
ms
Deep
Reinforcement
Learning
for Adaptive Power Allocation in ISAC Systems with Mobile Target
🔌
Embedded Systems
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Deep Reinforcement Learning for Adaptive Power Allocation in ISAC Systems with Mobile Target
Computer Vision and Geometry Group | Robot
Learning
🤖
Machine Learning
cvg.ethz.ch
·
11h
11 hours ago
·
Cited by 1 article
Actions for Computer Vision and Geometry Group | Robot Learning
Q-Learning
(
Reinforcement
learning
): Bellman Equation,
Markov
Decision Processes, Q-Values, and…
🔍
AI Interpretability
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for Q-Learning (Reinforcement learning): Bellman Equation, Markov Decision Processes, Q-Values, and…
The Era of
Multi-Agent
Imagined Experience
🔍
AI Interpretability
odyssey.ml
·
1d
1 day ago
·
Hacker News
Actions for The Era of Multi-Agent Imagined Experience
Reasoning RL in 2026: GRPO, DPO, RLVR,
Agentic
PO
& Beyond
⚡
Incremental Computation
turingpost.com
·
6d
6 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
How to Implement a Model-Free RL Algorithm: A Step-by-Step Guide
🔍
AI Interpretability
Content type:
Blog
ujangriswanto08.medium.com
·
2d
2 days ago
Actions for How to Implement a Model-Free RL Algorithm: A Step-by-Step Guide
Catecholamine precursor modulation of human
exploration
: Evidence from a large gender-balanced sample
🗃️
Zettelkasten
journals.plos.org
·
2d
2 days ago
Actions for Catecholamine precursor modulation of human exploration: Evidence from a large gender-balanced sample
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
🤖
Software Engineering, AI, Personal Knowledge Mangement, Strongly Typed Languages, Math, Abstractions, Data Models, Event Sourcing
anjalishriva.com
·
4d
4 days ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
Researchers develop AI-powered railway control system for efficient urban train operation
🤖
Machine Learning
techxplore.com
·
3d
3 days ago
Actions for Researchers develop AI-powered railway control system for efficient urban train operation
Deterministic
Policy
Gradient
for
Learning
Equilibrium in Time-Inconsistent Control Problems
🔍
AI Interpretability
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Deterministic Policy Gradient for Learning Equilibrium in Time-Inconsistent Control Problems
Multi-Agent
Reinforcement
Learning
from Delayed Marketplace Feedback for Objective-Weight Adaptation in Three-Sided Dispatch
⚡
Incremental Computation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multi-Agent Reinforcement Learning from Delayed Marketplace Feedback for Objective-Weight Adaptation in Three-Sided Dispatch
Individual Control Barrier
Functions-Guided
Diffusion Model for Safe Offline
Multi-Agent
Reinforcement
Learning
🔍
AI Interpretability
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Individual Control Barrier Functions-Guided Diffusion Model for Safe Offline Multi-Agent Reinforcement Learning
Fast and Highly Expressive
Policy
Learning
for Offline
Reinforcement
Learning
via Bootstrapped Flow
Q-Learning
⚡
Incremental Computation
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning
Reinforcement
Learning
for Neural Model Editing
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Reinforcement Learning for Neural Model Editing
Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using
Reinforcement
Learning
🔍
AI Interpretability
Content type:
Academic
arxiv.org
·
4d
4 days ago
Actions for Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning
SENTINEL: Failure-Driven
Reinforcement
Learning
for Training Tool-Using Language Model
Agents
🔍
AI Interpretability
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for SENTINEL: Failure-Driven Reinforcement Learning for Training Tool-Using Language Model Agents
PolicyGuard
: Towards Test-time and Step-level Adversary Defense for
Reinforcement
Learning
Agent
⚡
Incremental Computation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for PolicyGuard: Towards Test-time and Step-level Adversary Defense for Reinforcement Learning Agent
Development of COVID-19 Booster Vaccine
Policy
by Microsimulation and
Q-learning
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning
Multi-agent
rendezvous in fluid flows via
reinforcement
learning
🔍
AI Interpretability
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Multi-agent rendezvous in fluid flows via reinforcement learning
Deep
reinforcement
learning
for process design: Review and perspective
🔍
AI Interpretability
Content type:
Academic
arxiv.org
·
4d
4 days ago
Actions for Deep reinforcement learning for process design: Review and perspective
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help