Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 395 posts in 8.0 ms

Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization

 🤖Machine Learning  Content type: Academic
arxiv.org·

Stubborn: A Streamlined and Unified Reinforcement Learning Framework for Robust Motion Tracking and Fall Recovery for Humanoids

 Incremental Computation  Content type: Academic
arxiv.org·

Geometrically Averaged Hard Target Updates for Linear Q-Learning

 Incremental Computation  Content type: Academic
arxiv.org·

Improving Generalization and Data Efficiency with Diffusion in Offline Multi-agent RL

 🌍Distributed Systems  Content type: Academic
arxiv.org·

Discovering Interpretable Multi-Parameter Control Policies for Evolutionary Algorithms Using Deep Reinforcement Learning

 🤖Machine Learning  Content type: Academic
arxiv.org·

UniIntervene: Agentic Intervention for Efficient Real-World Reinforcement Learning

 Incremental Computation  Content type: Academic
arxiv.org·

SHAPO: Sharpness-Aware Policy Optimization for Safe Exploration

 🔍AI Interpretability  Content type: Academic
arxiv.org·

KinematicRL: A Sim-to-Real Reinforcement Learning Framework For Social Navigation With Kinodynamic Feasibility

 Incremental Computation  Content type: Academic
arxiv.org·

Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning

 λFunctional Programming  Content type: Academic
arxiv.org··Hacker News

Performance Variation in Deep Reinforcement Learning

 🤖Machine Learning  Content type: Academic
arxiv.org·

Keep Policy Gradient in Charge: Sibling-Guided Credit Distillation for Long-Horizon Tool-Use Agents

 Incremental Computation  Content type: Academic
arxiv.org·

Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

 🔍AI Interpretability  Content type: Academic

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

 Incremental Computation  Content type: Academic
arxiv.org·

Learning to Adapt: Representation-Based Reinforcement Learning for Multi-Task Skill Transfer

 Incremental Computation  Content type: Academic
arxiv.org·

Reasoning or Memorization? Direction-Aware Diversity Exploration in LLM Reinforcement Learning

 🔍AI Interpretability  Content type: Academic
arxiv.org·

Redesigning Regularization for Effective Policy Smoothing

 🔍AI Interpretability  Content type: Academic
arxiv.org·

Reinforcement Learning for Flow-Matching Policies with Density Transport

 🤖Machine Learning  Content type: Academic
arxiv.org·

Critic Architecture Matters: Dual vs. Unified Critics for Humanoid Loco-Manipulation

 Incremental Computation  Content type: Academic
arxiv.org·

PAWS: Preference Learning with Advantage-Weighted Segments

 🔍AI Interpretability  Content type: Academic
arxiv.org·

Event-Driven Reinforcement Learning Enables Long-Horizon Control in Semiconductor Fabrication

 🌀Complexity Science  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help