Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
RL, Rewards, Agents, Policy, Q-learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
395
posts in
5.6
ms
OrderGrad: Optimizing Beyond the Mean with Order-Statistic
Policy
Gradient
Estimation
🎯
RLHF
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for OrderGrad: Optimizing Beyond the Mean with Order-Statistic Policy Gradient Estimation
COP-Q: Safety-First
Reinforcement
Learning
for Robot Control via Cholesky-Ordered Projection
🎯
RLHF
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for COP-Q: Safety-First Reinforcement Learning for Robot Control via Cholesky-Ordered Projection
Selective-Advantage Entropy-Adaptive Horizon GRPO: Asymmetric Token-Level Discounting for Efficient
Reinforcement
Learning
of Language Models
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Selective-Advantage Entropy-Adaptive Horizon GRPO: Asymmetric Token-Level Discounting for Efficient Reinforcement Learning of Language Models
Neetyabhas: A Framework for Uncertainty-Aware Public
Policy
Optimization in Rational
Agent-Based
Models
📊
Bayesian Statistics
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Neetyabhas: A Framework for Uncertainty-Aware Public Policy Optimization in Rational Agent-Based Models
Reproducing, Analyzing, and Detecting
Reward
Hacking in Rubric-Based
Reinforcement
Learning
🎯
RLHF
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning
AgentJet
: A Flexible Swarm Training Framework for Agentic
Reinforcement
Learning
🎯
AI Agents
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for AgentJet: A Flexible Swarm Training Framework for Agentic Reinforcement Learning
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help