Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 299 posts in 4.2 ms

Performance Variation in Deep Reinforcement Learning

 🔥PyTorch  Content type: Academic
arxiv.org·

Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)

 📊Algorithms  Content type: Academic
web.mit.edu··Hacker News

SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.

 🔍Code Review  Content type: Code
github.com··r/opensource

Good teachers don’t cheat

 📈Optimization  Content type: Blog

Deep reinforcement learning for process design: Review and perspective

 🧠Deep Learning  Content type: Academic
arxiv.org·

Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning

 🧠Deep Learning  Content type: Academic
arxiv.org·

Self-Paced Curriculum Reinforcement Learning for Autonomous Superbike Racing in Simulation

 Code Generation  Content type: Academic
arxiv.org·

GIFT: LLM-Guided State-Reward Interface for Financial Reinforcement Learning

 💬Prompt Engineering  Content type: Academic
arxiv.org·

Policy Gradient for Continuous-Time Robust Markov Decision Processes

 📈Optimization  Content type: Academic
arxiv.org·

Path Planning Using Deep Deterministic Policy Gradient: A Reinforcement Learning Approach

 🔥PyTorch  Content type: Academic
arxiv.org·

Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep Reinforcement Learning

 🔥PyTorch  Content type: Academic
arxiv.org·

Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning

 📝NLP  Content type: Academic
arxiv.org·

Progress-SQL: Improving Reinforcement Learning for Text-to-SQL via Progressive Rewards

 🐘PostgreSQL  Content type: Academic
arxiv.org·

Self-Distilled Policy Gradient

 📈Optimization  Content type: Academic
arxiv.org·

Reformulate LLM Reinforcement Learning for Efficient Training under Black-box Discrepancy

 🤖LLMs  Content type: Academic
arxiv.org·

Representation Learning Enables Scalable Multitask Deep Reinforcement Learning

 🧠Deep Learning  Content type: Academic
arxiv.org·

Belief-Space Quantum-Inspired Reinforcement Learning for Partially Observable Autonomous Cyber Defense in the Internet of Vehicles

 🔒Network Security  Content type: Academic
arxiv.org·

Enhancing the MADDPG Algorithm for Multi-Agent Learning via Action Inference and Importance Sampling

 🤖LLMs  Content type: Academic
arxiv.org·

QnRL: Quantum-Native Reinforcement Learning

 🦙Ollama  Content type: Academic
arxiv.org·

On Advantage Estimates for Max@K Policy Gradients

 📈Optimization  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help