Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 425 posts in 58.4 ms

China women’s volleyball team finish Nations League leg on a high after opening defeat

 🏙️Urban Planning  Content type: News
scmp.com
··r/SCMPauto

2026 FIVB Volleyball Women's Nations League in Nanjing: Poland beats Czech Republic 3-0

 🏙️Urban Planning
ecns.cn·

Spotlight On: Dreamplug Technologies Private Limited (CRED), a New Principal Participating Organization

 🧘Digital Minimalism  Content type: Blog

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

 📊Optimization  Content type: Academic
arxiv.org·

BeatpulseLabs raises $1.8M pre-seed to scale AI training data

 🤖Machine learning  Content type: News
tech.eu·

Protest against ballot paper shortages enters 2nd day, demanding new election

 🗺Maps  Content type: News
koreatimes.co.kr··r/news

Semi-finalists confirmed in Secondary Schools Volleyball Competition

 🔬Food Science
cbc.bb·

Optimisation over non-stationary distributions creates weirder minds

 📊Optimization
lesswrong.com·

Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning

 📊Optimization  Content type: Academic
nature.com·

What is MBPO? A Beginner’s Guide to Efficient Reinforcement Learning

 🤖Machine learning  Content type: Blog

Social intelligence Arises Between Minds

 🔭Philosophy of Science
psychologytoday.com·

Event-Driven Reinforcement Learning Enables Long-Horizon Control in Semiconductor Fabrication

 📊Optimization  Content type: Academic
arxiv.org·

See, Act, Correct: three levers for working with a code agent

 📊Optimization  Content type: Blog

Central College News

 🔬Food Science  Content type: Academic
news.central.edu·

Combermere and Harrison College reach Under-15 basketball final

 🔬Food Science
cbc.bb·

Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning

 📊Statistical Computing  Content type: Academic
arxiv.org·

Bridging Multi-Vector and Learned-Sparse Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!

 🤖Machine learning  Content type: News  Content type: Blog

Sasha Rush explains targeted on-policy self-distillation, a reinforcement learning technique that corrects specific LLM rollout errors

 🤖Machine learning
digg.com·

Geometry-Aware Reinforcement Learning for 2D Irregular Nesting

 📊Optimization  Content type: Academic
arxiv.org·

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents

 🤖Machine learning  Content type: Blog

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help