Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
Deep RL, Policy Gradients, Q-Learning, Multi-Agent Systems
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
143
posts in
5.6
ms
Import AI 460: Reward hacking society, RSI data from Anthropic; and
RL-based
quadcopter racing
✈
AFSIM and Air Combat
Content type:
News
Content type:
Blog
importai.substack.com
·
2d
2 days ago
·
Substack
Actions for Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing
Fenn Tower Through Time: The Story of CSU’s Enduring Landmark
✈️
Aviation
Content type:
Academic
csuohio.edu
·
11h
11 hours ago
Actions for Fenn Tower Through Time: The Story of CSU’s Enduring Landmark
23 Years Ago, This Hit Comedy Hit Theaters as a Secret ‘Fight Club’ Parody, and Nobody Noticed
🤨
AI Skepticism
Content type:
News
vice.com
·
9h
9 hours ago
Actions for 23 Years Ago, This Hit Comedy Hit Theaters as a Secret ‘Fight Club’ Parody, and Nobody Noticed
cakewalk wyrm
⚡
Modern C++
thevalleybelow.id
·
2d
2 days ago
Actions for cakewalk wyrm
Geometrically Averaged Hard Target Updates for Linear
Q-Learning
🤖
Game AI
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Geometrically Averaged Hard Target Updates for Linear Q-Learning
You'
re
doing it wrong
🤨
AI Skepticism
Content type:
News
understandably.com
·
1d
1 day ago
Actions for You're doing it wrong
Central College News
✈️
Aviation
Content type:
Academic
news.central.edu
·
3d
3 days ago
Actions for Central College News
Less-relevant results
The Appointment Beneath the Appointment
🤨
AI Skepticism
Content type:
Blog
firstchurchofthesingularity.com
·
13h
13 hours ago
Actions for The Appointment Beneath the Appointment
Combermere and Harrison College reach Under-15 basketball final
✈️
Aviation
cbc.bb
·
4d
4 days ago
Actions for Combermere and Harrison College reach Under-15 basketball final
Failure Modes of
Deep
Multi-Agent
RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix
🤖
AI and Tactical Agents
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Failure Modes of Deep Multi-Agent RL in Asynchronous Pricing: Reproducible Triggers, Trace Diagnostics, and a Partial Fix
Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms
🎨
GPU Computing
Content type:
Blog
cncf.io
·
2d
2 days ago
Actions for Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms
Heuristic
multi-site
optimization
for protein sequence design using Masked Protein Language Models
🐧
Computing Systems
journals.plos.org
·
5d
5 days ago
Actions for Heuristic multi-site optimization for protein sequence design using Masked Protein Language Models
OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training
agents
.
🤖
AI and Tactical Agents
Content type:
Blog
huggingface.co
·
2d
2 days ago
·
Hacker News
,
r/LocalLLaMA
Actions for OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.
Hey-Meadow/meadow-mind: Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM.
🤖
Game AI
Content type:
Code
github.com
·
5h
5 hours ago
·
Hacker News
Actions for Hey-Meadow/meadow-mind: Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM.
U.S. Dental Insurance Market Growth, Coverage
Trends
and Industry Forecast
⚖️
AI Regulation
community.ops.io
·
2d
2 days ago
Actions for U.S. Dental Insurance Market Growth, Coverage Trends and Industry Forecast
Discovering Interpretable
Multi-Parameter
Control
Policies
for Evolutionary Algorithms Using
Deep
Reinforcement Learning
🤖
Game AI
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Discovering Interpretable Multi-Parameter Control Policies for Evolutionary Algorithms Using Deep Reinforcement Learning
I got so mad at poke(rogue)like that I trained a
RL
agent
to beat it for me
🤖
Game AI
Content type:
Blog
blog.thiagolira.com.br
·
6d
6 days ago
·
Hacker News
Actions for I got so mad at poke(rogue)like that I trained a RL agent to beat it for me
Students discover long-lost Roman villa under high school
gym
🤖
Game AI
Content type:
News
popsci.com
·
2d
2 days ago
Actions for Students discover long-lost Roman villa under high school gym
Test Your Skills Against an AI Air Hockey Robot
✈
AFSIM and Air Combat
Content type:
News
hackster.io
·
6d
6 days ago
Actions for Test Your Skills Against an AI Air Hockey Robot
Flow-DPPO: Divergence
Proximal
Policy
Optimization
for Flow Matching Models
🤖
Game AI
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help