Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 128 posts in 6.3 ms

Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning

 🤖ML  Content type: Academic
arxiv.org·

Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms

 🤖AI  Content type: Blog
cncf.io·

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

 🤖ML

U.S. Dental Insurance Market Growth, Coverage Trends and Industry Forecast

 🔧Feature Engineering
community.ops.io·

Sequent: scale and automation for higher confidence in alignment

 🤖AI
lesswrong.com·

Test Your Skills Against an AI Air Hockey Robot

 🤖ML  Content type: News
hackster.io·

Understanding your paycheck in Workday

 📈Time Series  Content type: Academic
news.clemson.edu·

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

 🤖AI  Content type: Academic
arxiv.org·

I built a machine that turns AI papers into interactive explainers

 🤖AI  Content type: Blog
blog.skz.dev·

A Human-Augmenting Agentic Workflow for Causal Inference

 🤖AI  Content type: Blog

Dmsh: A Multi-Agent Reinforcement Learning Framework for All-Quad Mesh Generation

 🌐Distributed Systems  Content type: Academic
arxiv.org·

‘I don’t want my children to grow up in a broken family’: Abused husbands in S’pore who are unseen

 🤖AI
Less-relevant results

San Francisco Construction Security Company: Complete Guide to Protecting Your Job Site in 2026

 📈Time Series  Content type: Blog
medium.com·

Linux Falls Hard on Steam After Record 5% Milestone

 📈Time Series
linuxiac.com·

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

 🤖AI  Content type: Academic
arxiv.org·

SLUUG Talk: Demystifying Large Language Models on Linux

 🤖AI  Content type: Code
github.com··DEV

Representation-Aware Advantage Estimation: Your Reward Model Provides More Than A Scalar Output

 🤖AI  Content type: Academic
arxiv.org·

Mitigating Bias in Low-SNR Financial Reinforcement Learning via Quantum Representations

 🤖AI  Content type: Academic
arxiv.org·

(VERY PARTIAL) CROSSPOST: ALEX HEATH: SubStack Is Opening Up to AI: Interviewing CEO Chris Best

 🤖AI  Content type: News  Content type: Blog

TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution

 🤖AI  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help