Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, reward functions, policy gradient, agents, simulation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
187323
posts in
23.0
ms
Preserving
Disagreement
: Architectural
Heterogeneity
and Coherence Validation in Multi-Agent Policy Simulation
🧠
AI Agents
arxiv.org
·
23h
SpecRLBench
: A Benchmark for Generalization in
Specification-Guided
Reinforcement Learning
🧠
LLMs
arxiv.org
·
2d
Lifting
Embodied
World Models for Planning and Control
🧠
AI Agents
arxiv.org
·
23h
From
Coarse
to Fine: Self-Adaptive
Hierarchical
Planning for LLM Agents
🧠
AI Agents
arxiv.org
·
2d
Split over $n$ resource sharing problem: Are fewer
capable
agents better than many
simpler
ones?
🕸️
Distributed Systems
arxiv.org
·
23h
CoFi-PGMA
: Counterfactual Policy Gradients under Filtered Feedback for Multi-Agent LLMs
🧠
LLMs
arxiv.org
·
2d
reward-lens: A
Mechanistic
Interpretability
Library for Reward Models
🧠
LLMs
arxiv.org
·
23h
Frictive
Policy Optimization for LLMs:
Epistemic
Intervention, Risk-Sensitive Control, and Reflective Alignment
🧠
LLMs
arxiv.org
·
1d
NeuroPlastic
: A Plasticity-Modulated Optimizer for
Biologically
Inspired Learning Dynamics
🧠
LLMs
arxiv.org
·
23h
Perfecting
Aircraft
Maneuvers
with Reinforcement Learning
🚗
Autonomous Systems
arxiv.org
·
2d
Co-Learning
Port-Hamiltonian
Systems and Optimal
Energy-Shaping
Control
🚗
Autonomous Systems
arxiv.org
·
23h
TCOD
: Exploring Temporal
Curriculum
in On-Policy Distillation for Multi-turn Autonomous Agents
🧠
LLMs
arxiv.org
·
2d
3D Generation for
Embodied
AI and
Robotic
Simulation: A Survey
🤖
Robotics
arxiv.org
·
23h
SOLAR-RL
: Semi-Online Long-horizon
Assignment
Reinforcement Learning
🧠
AI Agents
arxiv.org
·
3d
BitRL
: Reinforcement Learning with 1-bit
Quantized
Language Models for Resource-Constrained Edge Deployment
⚙️
MLOps
arxiv.org
·
2d
CODA
:
Coordination
via On-Policy Diffusion for Multi-Agent Offline Reinforcement Learning
🕸️
Distributed Systems
arxiv.org
·
2d
AEL
: Agent
Evolving
Learning for Open-Ended Environments
🧠
AI Agents
arxiv.org
·
6d
CAPSULE:
Control-Theoretic
Action
Perturbations
for Safe Uncertainty-Aware Reinforcement Learning
🚗
Autonomous Systems
arxiv.org
·
2d
Reinforcement Learning with Foundation
Priors
: Let the Embodied Agent
Efficiently
Learn on Its Own
🧠
AI Agents
arxiv.org
·
6d
Agent-Centric
Visual Reinforcement Learning under Dynamic
Perturbations
🧠
AI Agents
arxiv.org
·
2d
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help