Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
馃幆 Reinforcement Learning
Q-Learning, Policy Gradients, Game Theory, Decision Making
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
98
posts in
5.3
ms
GIFT: LLM-Guided
State-Reward
Interface for Financial
Reinforcement
Learning
聽
馃搳
Quantitative Finance
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for GIFT: LLM-Guided State-Reward Interface for Financial Reinforcement Learning
Structure-Conditioned
Actor-Critic
Branches for Quality-Diversity
Reinforcement
Learning
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Structure-Conditioned Actor-Critic Branches for Quality-Diversity Reinforcement Learning
Path Planning Using
Deep
Deterministic
Policy
Gradient
: A Reinforcement Learning Approach
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Path Planning Using Deep Deterministic Policy Gradient: A Reinforcement Learning Approach
On Advantage Estimates for Max@K
Policy
Gradients
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for On Advantage Estimates for Max@K Policy Gradients
Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using
Reinforcement
Learning
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning
Discovering Interpretable
Multi-Parameter
Control
Policies
for Evolutionary Algorithms Using
Deep
Reinforcement Learning
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Discovering Interpretable Multi-Parameter Control Policies for Evolutionary Algorithms Using Deep Reinforcement Learning
Performance Variation in
Deep
Reinforcement
Learning
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
3d
3 days ago
Actions for Performance Variation in Deep Reinforcement Learning
Learning
Predictive Control with
Deep
Koopman Operators for Autonomous Vehicle Motion Planning
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Learning Predictive Control with Deep Koopman Operators for Autonomous Vehicle Motion Planning
Beyond Uniform Token-Level Trust Region in LLM
Reinforcement
Learning
聽
馃挰
NLP
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning
Retry
Policy
Gradients
in Continuous Action Spaces
聽
馃搳
Quantitative Finance
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Retry Policy Gradients in Continuous Action Spaces
Failure Modes of
Deep
Multi-Agent
RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix
聽
馃搳
Quantitative Finance
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Failure Modes of Deep Multi-Agent RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix
Learning
to replenish: A hybrid
deep
reinforcement
learning
for dynamic inventory management in the pharmaceutical supply chains
聽
馃搳
Quantitative Finance
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains
Mitigating Bias in Low-SNR Financial
Reinforcement
Learning
via Quantum Representations
聽
馃搳
Quantitative Finance
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Mitigating Bias in Low-SNR Financial Reinforcement Learning via Quantum Representations
Uncertainty-Aware LLM-Guided
Policy
Shaping for
Sparse-Reward
Reinforcement
Learning
聽
馃挰
NLP
聽
Content type:
Academic
arxiv.org
路
3d
3 days ago
Actions for Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning
A Barrier-Modulated Architecture for Safe Affine Formation Control in Second-Order
Multi-Agent
Systems
聽
馃寪
Distributed Systems
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for A Barrier-Modulated Architecture for Safe Affine Formation Control in Second-Order Multi-Agent Systems
Drag reduction or
reward
hacking? Recurrent
multi-agent
reinforcement learning that earns its
reward
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Drag reduction or reward hacking? Recurrent multi-agent reinforcement learning that earns its reward
Test-Time
Gradient
Guidance of Flow
Policies
in
Reinforcement
Learning
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning
Quantum-Inspired
Reinforcement
Learning
for Low-Latency Intrusion Detection in V2X and Internet-of-Vehicles
Networks
聽
馃搳
Quantitative Finance
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Quantum-Inspired Reinforcement Learning for Low-Latency Intrusion Detection in V2X and Internet-of-Vehicles Networks
Selective-Advantage Entropy-Adaptive Horizon GRPO: Asymmetric Token-Level Discounting for Efficient
Reinforcement
Learning
of Language Models
聽
馃挰
NLP
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Selective-Advantage Entropy-Adaptive Horizon GRPO: Asymmetric Token-Level Discounting for Efficient Reinforcement Learning of Language Models
Self-Paced Curriculum
Reinforcement
Learning
for Autonomous Superbike Racing in Simulation
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Self-Paced Curriculum Reinforcement Learning for Autonomous Superbike Racing in Simulation
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help