Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
430
posts in
6.7
ms
Performance Variation in Deep
Reinforcement
Learning
🗣️
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Performance Variation in Deep Reinforcement Learning
Dmsh: A Multi-Agent
Reinforcement
Learning
Framework for All-Quad Mesh Generation
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Dmsh: A Multi-Agent Reinforcement Learning Framework for All-Quad Mesh Generation
TT-DAC-PS: Twin-Target Deterministic
Actor-Critic
with
Policy
Smoothing for Optimal Trade Execution
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution
Discovering Interpretable Multi-Parameter Control
Policies
for Evolutionary Algorithms Using Deep
Reinforcement
Learning
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Discovering Interpretable Multi-Parameter Control Policies for Evolutionary Algorithms Using Deep Reinforcement Learning
Structure-Conditioned
Actor-Critic
Branches for Quality-Diversity
Reinforcement
Learning
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Structure-Conditioned Actor-Critic Branches for Quality-Diversity Reinforcement Learning
Geometry-Aware
Reinforcement
Learning
for 2D Irregular Nesting
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Geometry-Aware Reinforcement Learning for 2D Irregular Nesting
UNIQ: Conformal Calibration for Adaptive Conservatism in Offline
Reinforcement
Learning
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning
SocraticPO:
Policy
Optimization via Interactive Guidance
🗣️
LLMs
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for SocraticPO: Policy Optimization via Interactive Guidance
Offline
Reinforcement
Learning
for Plasma Control in Nuclear Fusion: Codebase and Benchmark
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Offline Reinforcement Learning for Plasma Control in Nuclear Fusion: Codebase and Benchmark
On-sky demonstration of
reinforcement
learning
for adaptive optics control
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for On-sky demonstration of reinforcement learning for adaptive optics control
Policy
Gradient
for Continuous-Time Robust
Markov
Decision Processes
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Policy Gradient for Continuous-Time Robust Markov Decision Processes
Uncertainty-Aware LLM-Guided
Policy
Shaping for
Sparse-Reward
Reinforcement
Learning
🗣️
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning
Beyond Uniform Token-Level Trust Region in LLM
Reinforcement
Learning
🗣️
LLMs
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning
Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using
Reinforcement
Learning
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning
RoboNaldo: Accurate, Stable and Powerful Humanoid Soccer Shooting via Motion-Guided Curriculum
Reinforcement
Learning
🤖
AI
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for RoboNaldo: Accurate, Stable and Powerful Humanoid Soccer Shooting via Motion-Guided Curriculum Reinforcement Learning
Representation
Learning
Enables Scalable Multitask Deep
Reinforcement
Learning
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Representation Learning Enables Scalable Multitask Deep Reinforcement Learning
Reinforcement
Learning
for Flow-Matching
Policies
with Density Transport
🗣️
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Reinforcement Learning for Flow-Matching Policies with Density Transport
Event-Driven
Reinforcement
Learning
Enables Long-Horizon Control in Semiconductor Fabrication
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Event-Driven Reinforcement Learning Enables Long-Horizon Control in Semiconductor Fabrication
Rethinking the Divergence Regularization in LLM
RL
🗣️
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Rethinking the Divergence Regularization in LLM RL
How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted
RL
in LLMs
🗣️
LLMs
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help