Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 298 posts in 18.8 ms

Reinforcement Learning for Flow-Matching Policies with Density Transport

馃搱OptimizationContent type: Academic
arxiv.org

Cooperative Long Rope Skipping via Multi-Agent Reinforcement Learning

馃AIContent type: Academic
arxiv.org

COP-Q: Safety-First Reinforcement Learning for Robot Control via Cholesky-Ordered Projection

馃幁Anthropic ClaudeContent type: Academic
arxiv.org

Constrained Deep Reinforcement Learning for Cognitive Radar Resource Management

馃Deep LearningContent type: Academic
arxiv.org

HARBOR: A Harness Framework for Agentic Robot Reinforcement Learning

馃幁Anthropic ClaudeContent type: Academic
arxiv.org

Learning to Attack and Defend: Adaptive Red Teaming of Language Models via GRPO

馃TransformersContent type: Academic
arxiv.org

Trace-Mediated Peak Bias: Bridging Temporal Credit Assignment and Cognitive Heuristics in Deep Reinforcement Learning

馃搱OptimizationContent type: Academic
arxiv.org

PRPO: Perception-Reinforced Policy Optimization via Token-Level Dynamic Advantage Reshaping

馃搱OptimizationContent type: Academic
arxiv.org

Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains

馃Machine LearningContent type: Academic
arxiv.org

Exact Unlearning in Reinforcement Learning

馃LLMsContent type: Academic
arxiv.org

Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment

馃搱OptimizationContent type: Academic
arxiv.org

Drag reduction or reward hacking? Recurrent multi-agent reinforcement learning that earns its reward

馃敳Cellular AutomataContent type: Academic
arxiv.org

Selective-Advantage Entropy-Adaptive Horizon GRPO: Asymmetric Token-Level Discounting for Efficient Reinforcement Learning of Language Models

馃搱OptimizationContent type: Academic
arxiv.org

Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning

馃幁Anthropic ClaudeContent type: Academic
arxiv.org

Explainably Safe Reinforcement Learning

馃挰Prompt EngineeringContent type: Academic
arxiv.org

From Ticks to Flows: Dynamics of Neural Reinforcement Learning in Continuous Environments

馃Deep LearningContent type: Academic
arxiv.org

GARL: Game-Theoretic Reinforcement Learning for Multi-Agent Strategic Prioritisation

鈿欙笍Concurrency ModelsContent type: Academic
arxiv.org

Self-Optimizing Control of Continuous Processes Based on Reinforcement Learning

馃搱OptimizationContent type: Academic
arxiv.org

Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies

馃AIContent type: Academic
arxiv.org

RUBAS: Rubric-Based Reinforcement Learning for Agent Safety

馃攼CryptographyContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help