Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 143 posts in 5.6 ms

Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing

 AFSIM and Air Combat  Content type: News  Content type: Blog

Fenn Tower Through Time: The Story of CSU’s Enduring Landmark

 ✈️Aviation  Content type: Academic
csuohio.edu·

23 Years Ago, This Hit Comedy Hit Theaters as a Secret ‘Fight Club’ Parody, and Nobody Noticed

 🤨AI Skepticism  Content type: News
vice.com·

cakewalk wyrm

 Modern C++
thevalleybelow.id·

Geometrically Averaged Hard Target Updates for Linear Q-Learning

 🤖Game AI  Content type: Academic
arxiv.org·

You're doing it wrong

 🤨AI Skepticism  Content type: News
understandably.com·

Central College News

 ✈️Aviation  Content type: Academic
news.central.edu·
Less-relevant results

The Appointment Beneath the Appointment

 🤨AI Skepticism  Content type: Blog

Combermere and Harrison College reach Under-15 basketball final

 ✈️Aviation
cbc.bb·

Failure Modes of Deep Multi-Agent RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix

 🤖AI and Tactical Agents  Content type: Academic
arxiv.org·

Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms

 🎨GPU Computing  Content type: Blog
cncf.io·

Heuristic multi-site optimization for protein sequence design using Masked Protein Language Models

 🐧Computing Systems
journals.plos.org·

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

 🤖AI and Tactical Agents  Content type: Blog

Hey-Meadow/meadow-mind: Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM.

 🤖Game AI  Content type: Code
github.com··Hacker News

U.S. Dental Insurance Market Growth, Coverage Trends and Industry Forecast

 ⚖️AI Regulation
community.ops.io·

Discovering Interpretable Multi-Parameter Control Policies for Evolutionary Algorithms Using Deep Reinforcement Learning

 🤖Game AI  Content type: Academic
arxiv.org·

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

 🤖Game AI  Content type: Blog

Students discover long-lost Roman villa under high school gym

 🤖Game AI  Content type: News
popsci.com·

Test Your Skills Against an AI Air Hockey Robot

 AFSIM and Air Combat  Content type: News
hackster.io·

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

 🤖Game AI  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help