Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
RL
🎮 RL
Specific
reinforcement learning, reward modeling, policy gradient
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
178
posts in
18.9
ms
Merging
model-based
control with
multi-agent
reinforcement learning for
multi-agent
cooperative teaming strategies
🌐
World Models
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies
Researchers develop AI-powered railway control system for efficient urban train operation
🌐
World Models
techxplore.com
·
1d
1 day ago
Actions for Researchers develop AI-powered railway control system for efficient urban train operation
I Got Tired of Rebuilding My Retro
RL
Projects
🎯
Post-training
Content type:
Blog
medium.com
·
5h
5 hours ago
Actions for I Got Tired of Rebuilding My Retro RL Projects
Q-Learning
(
Reinforcement
learning
): Bellman Equation, Markov Decision Processes, Q-Values, and…
🌐
World Models
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Q-Learning (Reinforcement learning): Bellman Equation, Markov Decision Processes, Q-Values, and…
Reasoning
RL
in 2026: GRPO, DPO, RLVR,
Agentic
PO
& Beyond
🎯
Post-training
turingpost.com
·
4d
4 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and
deep
reinforcement
learning
🌐
World Models
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning
SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
🌐
World Models
Content type:
Code
github.com
·
3d
3 days ago
·
r/opensource
Actions for SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
Less-relevant results
Some Interesting Papers on RLVR
🎯
Post-training
lesswrong.com
·
1d
1 day ago
Actions for Some Interesting Papers on RLVR
Reinforcement
Learning
and Optimal Control Book (RIP Dimitri Bertsekas)
🌐
World Models
Content type:
Academic
web.mit.edu
·
6d
6 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
China women’s volleyball team finish Nations League leg on a high after opening defeat
🌐
World Models
Content type:
News
scmp.com
·
2d
2 days ago
·
r/SCMPauto
Actions for China women’s volleyball team finish Nations League leg on a high after opening defeat
2026 FIVB Volleyball Women's Nations League in Nanjing: Poland beats Czech Republic 3-0
🌐
World Models
ecns.cn
·
6d
6 days ago
Actions for 2026 FIVB Volleyball Women's Nations League in Nanjing: Poland beats Czech Republic 3-0
Scale Robot
Reinforcement
Learning
with NVIDIA Isaac Lab on Amazon SageMaker AI
🌐
World Models
Content type:
Blog
aws.amazon.com
·
1d
1 day ago
Actions for Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI
Deterministic
Policy
Gradient
for
Learning
Equilibrium in Time-Inconsistent Control Problems
🌐
World Models
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for Deterministic Policy Gradient for Learning Equilibrium in Time-Inconsistent Control Problems
Protest against ballot paper shortages enters 2nd day, demanding new election
💬
LLMs
Content type:
News
koreatimes.co.kr
·
5d
5 days ago
·
r/news
Actions for Protest against ballot paper shortages enters 2nd day, demanding new election
Semi-finalists confirmed in Secondary Schools Volleyball Competition
💬
LLMs
cbc.bb
·
1d
1 day ago
Actions for Semi-finalists confirmed in Secondary Schools Volleyball Competition
Photos: Syracuse Views Through the Decades
🌐
World Models
Content type:
Academic
news.syr.edu
·
2d
2 days ago
Actions for Photos: Syracuse Views Through the Decades
Phi-Actor-Critic
: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria
🤖
AI Agents
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria
Central College News
📈
Economics
Content type:
Academic
news.central.edu
·
4d
4 days ago
Actions for Central College News
Why LLMs (still) lack taste
🎯
Post-training
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Hrithik Roshan Signs With Anonymous Content
🏋️
Pretraining
Content type:
News
deadline.com
·
22h
22 hours ago
Actions for Hrithik Roshan Signs With Anonymous Content
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help