Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 431 posts in 4.9 ms

Representation Learning Enables Scalable Multitask Deep Reinforcement Learning

馃攧Continual LearningContent type: Academic
arxiv.org

An Agency-Transferring Model-Free Policy Enhancement Technique

馃Active InferenceContent type: Academic
arxiv.org

QnRL: Quantum-Native Reinforcement Learning

馃Active InferenceContent type: Academic
arxiv.org

Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains

馃КEvolutionary ComputationContent type: Academic
arxiv.org

UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning

馃Active InferenceContent type: Academic
arxiv.org

On Advantage Estimates for Max@K Policy Gradients

馃Active InferenceContent type: Academic
arxiv.org

Learning Predictive Control with Deep Koopman Operators for Autonomous Vehicle Motion Planning

鈿欙笍Computational MechanicsContent type: Academic
arxiv.org

Offline Reinforcement Learning for Plasma Control in Nuclear Fusion: Codebase and Benchmark

馃攧Continual LearningContent type: Academic
arxiv.org

GARL: Game-Theoretic Reinforcement Learning for Multi-Agent Strategic Prioritisation

馃悵Collective IntelligenceContent type: Academic
arxiv.org

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning

馃悵Collective IntelligenceContent type: Academic
arxiv.org

Belief-Space Quantum-Inspired Reinforcement Learning for Partially Observable Autonomous Cyber Defense in the Internet of Vehicles

馃Neuromorphic ComputingContent type: Academic
arxiv.org

Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment

馃悵Collective IntelligenceContent type: Academic
arxiv.org

Reformulate LLM Reinforcement Learning for Efficient Training under Black-box Discrepancy

馃Active InferenceContent type: Academic
arxiv.org

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

馃Active InferenceContent type: Academic
arxiv.org

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

馃Active InferenceContent type: Academic
arxiv.org

COP-Q: Safety-First Reinforcement Learning for Robot Control via Cholesky-Ordered Projection

馃Developmental RoboticsContent type: Academic
arxiv.org

RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

鈿欙笍Computational MechanicsContent type: Academic
arxiv.org

Reinforcement Learning from Rich Feedback with Distributional DAgger

馃Active InferenceContent type: Academic
arxiv.org

BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

馃КEvolutionary ComputationContent type: Academic
arxiv.org

Retry Policy Gradients in Continuous Action Spaces

馃Active InferenceContent type: Academic
arxiv.org
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help