Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
Q-Learning, Policy Gradient, RL Agents, Game AI
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
257
posts in
12.0
ms
🤖
AI
news.mit.edu
·
5d
5 days ago
In
game
theory, generalists sometimes win out over specialists
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for In game theory, generalists sometimes win out over specialists
🤖
AI
Kilo Blog
·
10h
10 hours ago
Announcing Next-Edit in Kilo, Powered by Inception
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Announcing Next-Edit in Kilo, Powered by Inception
🤖
AI
arXiv
·
5d
5 days ago
Pareto
Q-Learning
with Reward Machines
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Pareto Q-Learning with Reward Machines
🤖
AI
The Batch
·
3d
3 days ago
Jun 19, 2026
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Jun 19, 2026
🤖
Machine Learning
introml.mit.edu
·
1d
1 day ago
Introduction to Machine
Learning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Introduction to Machine Learning
🤖
Machine Learning
ollama.com
·
5d
5 days ago
north-mini-code-1.0
Covers
2 stories
See all stories this covers
including
Enterprise AI: Private, Secure, Customizable
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for north-mini-code-1.0
🤖
AI
Databricks
·
5d
5 days ago
Cloned
Covers
NVIDIA Triton Inference Server — NVIDIA Triton Inference Server
Covered by
lebigdata.fr
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cloned
🔥
PyTorch
runtimewire.com
·
6d
6 days ago
Cursor Says 1.5T Parameter Coding Model Is Training on 100k GPUs
Covers
3 stories
See all stories this covers
including
Do you respect 'Vibe Coders'? Can you actually call them devs?
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cursor Says 1.5T Parameter Coding Model Is Training on 100k GPUs
Less-relevant results
🤖
AI
The New York Times
Content type:
Video
·
15h
15 hours ago
For Suns' Devin Booker, new number opens new chapter in search of better ending
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for For Suns' Devin Booker, new number opens new chapter in search of better ending
🤖
AI
LessWrong
·
1d
1 day ago
How persona training could fail
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How persona training could fail
🤖
AI
Technically
·
4d
4 days ago
What are code sandboxes?
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What are code sandboxes?
🤖
AI
ScienceDirect
·
4d
4 days ago
Global Structure-Aware R-Tree: a spatial indexing mechanism using
Deep
Reinforcement
Learning
and Self-Play
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Global Structure-Aware R-Tree: a spatial indexing mechanism using Deep Reinforcement Learning and Self-Play
🤖
Machine Learning
The Decoder
·
3d
3 days ago
Google
Deepmind
loses another top
AI
researcher as Nobel laureate John Jumper leaves for Anthropic
Covered by
何夕2077的个人站
,
habr.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Google Deepmind loses another top AI researcher as Nobel laureate John Jumper leaves for Anthropic
🤖
AI
Forbes
·
2d
2 days ago
Solution To The Curious Mystery Of Why
AI
Keeps Inventing The Same Fake Names Over And Over Again
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Solution To The Curious Mystery Of Why AI Keeps Inventing The Same Fake Names Over And Over Again
🔥
PyTorch
ScienceDirect
·
3d
3 days ago
Digital twin-driven
deep
reinforcement
learning
for coordinated scheduling and state prediction of distributed energy storage clusters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Digital twin-driven deep reinforcement learning for coordinated scheduling and state prediction of distributed energy storage clusters
🔬
Science
medium.com
·
3d
3 days ago
A Human-Augmenting
Agentic
Workflow for Causal Inference
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Human-Augmenting Agentic Workflow for Causal Inference
🔬
Science
Phys.org
·
2d
2 days ago
NASA testing advanced capabilities for moon, Mars rovers
Covered by
kite.kagi.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for NASA testing advanced capabilities for moon, Mars rovers
🔬
Science
PsyPost
·
18h
18 hours ago
Neuroscientists uncover how serotonin alters “belief stickiness”
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Neuroscientists uncover how serotonin alters “belief stickiness”
🤖
AI
shanethegamer.com
·
6d
6 days ago
They made a Pokemon TCG
AI
Battle Challenge with a $290k prize pool
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for They made a Pokemon TCG AI Battle Challenge with a $290k prize pool
🤖
AI
alisawuffles.github.io
·
22h
22 hours ago
Notes on the Industry Job Search
Covers
How To Scale Your Model
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Notes on the Industry Job Search
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report