Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, OpenAI Gym, Reward Functions
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
425
posts in
58.4
ms
China women’s volleyball team finish Nations League leg on a high after opening defeat
🏙️
Urban Planning
Content type:
News
scmp.com
·
2d
2 days ago
·
r/SCMPauto
Actions for China women’s volleyball team finish Nations League leg on a high after opening defeat
2026 FIVB Volleyball Women's Nations League in Nanjing: Poland beats Czech Republic 3-0
🏙️
Urban Planning
ecns.cn
·
5d
5 days ago
Actions for 2026 FIVB Volleyball Women's Nations League in Nanjing: Poland beats Czech Republic 3-0
Spotlight On: Dreamplug Technologies Private Limited (CRED), a New Principal Participating Organization
🧘
Digital Minimalism
Content type:
Blog
blog.pcisecuritystandards.org
·
2d
2 days ago
Actions for Spotlight On: Dreamplug Technologies Private Limited (CRED), a New Principal Participating Organization
Flow-DPPO: Divergence
Proximal
Policy
Optimization
for Flow Matching Models
📊
Optimization
Content type:
Academic
arxiv.org
·
20h
20 hours ago
Actions for Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
BeatpulseLabs raises $1.8M pre-seed to scale AI training data
🤖
Machine learning
Content type:
News
tech.eu
·
2d
2 days ago
Actions for BeatpulseLabs raises $1.8M pre-seed to scale AI training data
Protest against ballot paper shortages enters 2nd day, demanding new election
🗺
Maps
Content type:
News
koreatimes.co.kr
·
4d
4 days ago
·
r/news
Actions for Protest against ballot paper shortages enters 2nd day, demanding new election
Semi-finalists confirmed in Secondary Schools Volleyball Competition
🔬
Food Science
cbc.bb
·
1d
1 day ago
Actions for Semi-finalists confirmed in Secondary Schools Volleyball Competition
Optimisation over non-stationary distributions creates weirder minds
📊
Optimization
lesswrong.com
·
5d
5 days ago
Actions for Optimisation over non-stationary distributions creates weirder minds
Edge AI enabled MIMO MC-CDMA for 6G
optimizing
spectrum and energy efficiency with SIC and
deep
reinforcement
learning
📊
Optimization
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning
What is MBPO? A Beginner’s Guide to Efficient
Reinforcement
Learning
🤖
Machine learning
Content type:
Blog
ujangriswanto08.medium.com
·
5d
5 days ago
Actions for What is MBPO? A Beginner’s Guide to Efficient Reinforcement Learning
Social intelligence Arises Between Minds
🔭
Philosophy of Science
psychologytoday.com
·
3d
3 days ago
Actions for Social intelligence Arises Between Minds
Event-Driven
Reinforcement
Learning
Enables Long-Horizon Control in Semiconductor Fabrication
📊
Optimization
Content type:
Academic
arxiv.org
·
20h
20 hours ago
Actions for Event-Driven Reinforcement Learning Enables Long-Horizon Control in Semiconductor Fabrication
See, Act, Correct: three levers for working with a code agent
📊
Optimization
Content type:
Blog
blog.owulveryck.info
·
6d
6 days ago
·
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
Central College News
🔬
Food Science
Content type:
Academic
news.central.edu
·
3d
3 days ago
Actions for Central College News
Combermere and Harrison College reach Under-15 basketball final
🔬
Food Science
cbc.bb
·
4d
4 days ago
Actions for Combermere and Harrison College reach Under-15 basketball final
Development of COVID-19 Booster Vaccine
Policy
by Microsimulation and
Q-learning
📊
Statistical Computing
Content type:
Academic
arxiv.org
·
20h
20 hours ago
Actions for Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning
Bridging Multi-Vector and
Learned-Sparse
Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!
🤖
Machine learning
Content type:
News
Content type:
Blog
recsys.substack.com
·
5d
5 days ago
·
Substack
Actions for Bridging Multi-Vector and Learned-Sparse Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!
Sasha Rush explains targeted
on-policy
self-distillation, a
reinforcement
learning
technique that corrects specific LLM rollout errors
🤖
Machine learning
digg.com
·
6d
6 days ago
Actions for Sasha Rush explains targeted on-policy self-distillation, a reinforcement learning technique that corrects specific LLM rollout errors
Geometry-Aware
Reinforcement
Learning
for 2D Irregular Nesting
📊
Optimization
Content type:
Academic
arxiv.org
·
20h
20 hours ago
Actions for Geometry-Aware Reinforcement Learning for 2D Irregular Nesting
NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents
🤖
Machine learning
Content type:
Blog
developer.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help