Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
RLHF, Reward Models, Policy, Agents
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
246
posts in
16.0
ms
Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using
Reinforcement
Learning
🤖
Robotics
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning
Reinforcement
Learning
and Optimal Control Book (RIP Dimitri Bertsekas)
📐
Math
Content type:
Academic
web.mit.edu
·
5d
5 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
🕵️
LLM Agents
anjalishriva.com
·
1d
1 day ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
See,
Act
, Correct: three levers for working with a code
agent
🤖
AI
Content type:
Blog
blog.owulveryck.info
·
6d
6 days ago
·
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
Scale Robot
Reinforcement
Learning
with NVIDIA Isaac Lab on Amazon SageMaker AI
🧠
Machine Learning
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI
NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running
Agents
🕵️
LLM Agents
Content type:
Blog
developer.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents
Fast and Highly Expressive
Policy
Learning
for Offline
Reinforcement
Learning
via Bootstrapped Flow
Q-Learning
🔥
PyTorch
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning
Flow-DPPO: Divergence Proximal
Policy
Optimization for Flow Matching
Models
📐
Optimization Theory
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
Discovering Interpretable Multi-Parameter Control
Policies
for Evolutionary Algorithms Using
Deep
Reinforcement
Learning
🤖
AI
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Discovering Interpretable Multi-Parameter Control Policies for Evolutionary Algorithms Using Deep Reinforcement Learning
Development of COVID-19 Booster Vaccine
Policy
by Microsimulation and
Q-learning
📐
Semidefinite Programming
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning
Performance Variation in
Deep
Reinforcement
Learning
🧠
LLM
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Performance Variation in Deep Reinforcement Learning
Event-Driven
Reinforcement
Learning
Enables Long-Horizon Control in Semiconductor Fabrication
📐
Optimization Theory
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Event-Driven Reinforcement Learning Enables Long-Horizon Control in Semiconductor Fabrication
A Unifying Lens on
Reward
Uncertainty in
RLHF
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
Test-Time Gradient Guidance of Flow
Policies
in
Reinforcement
Learning
🤖
Robotics
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning
Merging
model-based
control with
multi-agent
reinforcement
learning for
multi-agent
cooperative teaming strategies
🕵️
LLM Agents
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies
Deep
reinforcement
learning
for process design: Review and perspective
🕵️
LLM Agents
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Deep reinforcement learning for process design: Review and perspective
Geometrically Averaged Hard Target Updates for Linear
Q-Learning
📐
Optimization Theory
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Geometrically Averaged Hard Target Updates for Linear Q-Learning
Self-Paced Curriculum
Reinforcement
Learning
for Autonomous Superbike Racing in Simulation
🎛️
Control Systems
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Self-Paced Curriculum Reinforcement Learning for Autonomous Superbike Racing in Simulation
Dmsh: A
Multi-Agent
Reinforcement
Learning
Framework for All-Quad Mesh Generation
📐
Semidefinite Programming
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Dmsh: A Multi-Agent Reinforcement Learning Framework for All-Quad Mesh Generation
GARL: Game-Theoretic
Reinforcement
Learning
for
Multi-Agent
Strategic Prioritisation
🕵️
LLM Agents
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for GARL: Game-Theoretic Reinforcement Learning for Multi-Agent Strategic Prioritisation
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help