Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
馃幃 Reinforcement Learning
RL, AI Agents, Game Playing, Policy Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
41
posts in
8.0
ms
Reinforcement
Learning
and
Optimal
Control Book (RIP Dimitri Bertsekas)
聽
馃
AI Research
聽
Content type:
Academic
web.mit.edu
路
5d
5 days ago
路
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
聽
馃
AI Research
anjalishriva.com
路
1d
1 day ago
路
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
Some Interesting Papers on RLVR
聽
馃
AI Research
lesswrong.com
路
1d
1 day ago
Actions for Some Interesting Papers on RLVR
AI-powered
living business intelligence network
聽
馃
AI Research
atlasforgex.com
路
15h
15 hours ago
路
Hacker News
Actions for AI-powered living business intelligence network
Memoirs of a
Learning
Machine: Autobiographical Self-Training and the Self-Training Gap
聽
馃
AI Research
zenodo.org
路
4d
4 days ago
路
Hacker News
Actions for Memoirs of a Learning Machine: Autobiographical Self-Training and the Self-Training Gap
Propel: Breaking the Solver Bottleneck in Task-Generator
RL
聽
馃
AI Research
vmax.ai
路
7h
7 hours ago
路
Hacker News
Actions for Propel: Breaking the Solver Bottleneck in Task-Generator RL
A wild idea: Abstract reality using ontology
聽
鉁嶏笍
Prompt Engineering
聽
Content type:
Discussion
news.ycombinator.com
路
4d
4 days ago
路
Hacker News
Actions for A wild idea: Abstract reality using ontology
Researchers trained an open source
AI
search
agent
, Harness-1, that outperforms GPT-5.4 on recalling relevant information
聽
馃
Claude
venturebeat.com
路
2d
2 days ago
路
Hacker News
Actions for Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
You are here on the
AI
change curve
聽
鉁嶏笍
Prompt Engineering
howfastis.ai
路
2d
2 days ago
路
Hacker News
,
Hacker News
Actions for You are here on the AI change curve
Arithmetic Pedagogy for Language Models
聽
馃
AI Research
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
路
Hacker News
Actions for Arithmetic Pedagogy for Language Models
Beyond Dexterity: Why Contact May Define the Next Era of Robotics
聽
馃
AI Research
聽
Content type:
Video
聽
Content type:
News
spectrum.ieee.org
路
1d
1 day ago
路
Hacker News
Actions for Beyond Dexterity: Why Contact May Define the Next Era of Robotics
Why LLMs (still) lack taste
聽
馃
AI Research
beyondtheprior.com
路
2d
2 days ago
路
Hacker News
Actions for Why LLMs (still) lack taste
I got so mad at poke(rogue)like that I trained a
RL
agent
to beat it for me
聽
馃
AI Research
聽
Content type:
Blog
blog.thiagolira.com.br
路
6d
6 days ago
路
Hacker News
Actions for I got so mad at poke(rogue)like that I trained a RL agent to beat it for me
Nvidia Nemotron 3 Ultra
聽
馃
AI Research
research.nvidia.com
路
6d
6 days ago
路
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Vibe Diaries: Training Nanochat
聽
馃
Machine Learning
vibediary.dev
路
2d
2 days ago
路
Hacker News
Actions for Vibe Diaries: Training Nanochat
Agentic
RL
: Token-In, Token-Out Done Right
聽
馃
AI Research
qgallouedec-tito.hf.space
路
1d
1 day ago
路
Hacker News
Actions for Agentic RL: Token-In, Token-Out Done Right
See,
Act
, Correct: three levers for working with a code
agent
聽
馃
Claude
聽
Content type:
Blog
blog.owulveryck.info
路
6d
6 days ago
路
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
Apple's New
AI
Models Contain 'None' of Google's Gemini Assistant
聽
馃
AI Research
聽
Content type:
News
macrumors.com
路
1d
1 day ago
路
Hacker News
Actions for Apple's New AI Models Contain 'None' of Google's Gemini Assistant
LLM Research Papers: The 2026 List (January to May)
聽
馃
AI Research
聽
Content type:
News
magazine.sebastianraschka.com
路
4d
4 days ago
路
Hacker News
Actions for LLM Research Papers: The 2026 List (January to May)
gaelazzo/python_chess: Chess trainer
聽
馃
AI Research
聽
Content type:
Code
github.com
路
1d
1 day ago
路
Hacker News
Actions for gaelazzo/python_chess: Chess trainer
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help