Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 98 posts in 7.0 ms

I built a machine that turns AI papers into interactive explainers

 🤖AI Research  Content type: Blog
blog.skz.dev·

Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning

 🤖AI Research  Content type: Academic
arxiv.org·

‘I don’t want my children to grow up in a broken family’: Abused husbands in S’pore who are unseen

 💬NLP

Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms

 🌐Distributed Systems  Content type: Blog
cncf.io·

Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization

 🤖AI Research  Content type: Academic
arxiv.org·

U.S. Dental Insurance Market Growth, Coverage Trends and Industry Forecast

 Cryptocurrency
community.ops.io·

Geometrically Averaged Hard Target Updates for Linear Q-Learning

 📊Quantitative Finance  Content type: Academic
arxiv.org·
Less-relevant results

A Human-Augmenting Agentic Workflow for Causal Inference

 🤖AI Research  Content type: Blog

San Francisco Construction Security Company: Complete Guide to Protecting Your Job Site in 2026

 High-Frequency Trading  Content type: Blog
medium.com·

Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning

 🤖AI Research  Content type: Academic
arxiv.org·

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

 📊Quantitative Finance  Content type: Academic
arxiv.org·

TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution

 📈Trading  Content type: Academic
arxiv.org·

Variational Proximal Policy Optimization

 🤖AI Research  Content type: Academic
arxiv.org·

Dmsh: A Multi-Agent Reinforcement Learning Framework for All-Quad Mesh Generation

 🤖AI Research  Content type: Academic
arxiv.org·

UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning

 📊Quantitative Finance  Content type: Academic
arxiv.org·

Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild

 💬NLP  Content type: Academic
arxiv.org·

Belief-Space Quantum-Inspired Reinforcement Learning for Partially Observable Autonomous Cyber Defense in the Internet of Vehicles

 📊Quantitative Finance  Content type: Academic
arxiv.org·

Seeing Before Colliding: Anticipatory Safe RL with Frozen Vision-Language Models

 💬NLP  Content type: Academic
arxiv.org·

Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning

 💬NLP  Content type: Academic
arxiv.org·

Merging model-based control with multi-agent reinforcement learning for multi-agent cooperative teaming strategies

 🤖AI Research  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help