Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
RL, AI Agents, Game Playing, Policy Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
41
posts in
8.6
ms
Reinforcement
Learning
and
Optimal
Control Book (RIP Dimitri Bertsekas)
✍️
Prompt Engineering
Content type:
Academic
web.mit.edu
·
5d
5 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
🧠
Claude
anjalishriva.com
·
1d
1 day ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
Some Interesting Papers on RLVR
✍️
Prompt Engineering
lesswrong.com
·
1d
1 day ago
Actions for Some Interesting Papers on RLVR
AI-powered
living business intelligence network
🗃️
Database Optimization
atlasforgex.com
·
12h
12 hours ago
·
Hacker News
Actions for AI-powered living business intelligence network
Memoirs of a
Learning
Machine: Autobiographical Self-Training and the Self-Training Gap
✍️
Prompt Engineering
zenodo.org
·
4d
4 days ago
·
Hacker News
Actions for Memoirs of a Learning Machine: Autobiographical Self-Training and the Self-Training Gap
Propel: Breaking the Solver Bottleneck in Task-Generator
RL
✍️
Prompt Engineering
vmax.ai
·
4h
4 hours ago
·
Hacker News
Actions for Propel: Breaking the Solver Bottleneck in Task-Generator RL
A wild idea: Abstract reality using ontology
✍️
Prompt Engineering
Content type:
Discussion
news.ycombinator.com
·
4d
4 days ago
·
Hacker News
Actions for A wild idea: Abstract reality using ontology
Researchers trained an open source
AI
search
agent
, Harness-1, that outperforms GPT-5.4 on recalling relevant information
🧠
Claude
venturebeat.com
·
2d
2 days ago
·
Hacker News
Actions for Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
You are here on the
AI
change curve
✍️
Prompt Engineering
howfastis.ai
·
2d
2 days ago
·
Hacker News
,
Hacker News
Actions for You are here on the AI change curve
Arithmetic Pedagogy for Language Models
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
6d
6 days ago
·
Hacker News
Actions for Arithmetic Pedagogy for Language Models
Beyond Dexterity: Why Contact May Define the Next Era of Robotics
✍️
Prompt Engineering
Content type:
Video
Content type:
News
spectrum.ieee.org
·
1d
1 day ago
·
Hacker News
Actions for Beyond Dexterity: Why Contact May Define the Next Era of Robotics
Why LLMs (still) lack taste
🚢
DevOps Automation
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
I got so mad at poke(rogue)like that I trained a
RL
agent
to beat it for me
🧠
Machine Learning
Content type:
Blog
blog.thiagolira.com.br
·
6d
6 days ago
·
Hacker News
Actions for I got so mad at poke(rogue)like that I trained a RL agent to beat it for me
Nvidia Nemotron 3 Ultra
⚙️
AI Infrastructure
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Vibe Diaries: Training Nanochat
🧠
Machine Learning
vibediary.dev
·
2d
2 days ago
·
Hacker News
Actions for Vibe Diaries: Training Nanochat
Agentic
RL
: Token-In, Token-Out Done Right
✍️
Prompt Engineering
qgallouedec-tito.hf.space
·
1d
1 day ago
·
Hacker News
Actions for Agentic RL: Token-In, Token-Out Done Right
See,
Act
, Correct: three levers for working with a code
agent
🧠
Claude
Content type:
Blog
blog.owulveryck.info
·
6d
6 days ago
·
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
Apple's New
AI
Models Contain 'None' of Google's Gemini Assistant
🤖
AI
Content type:
News
macrumors.com
·
1d
1 day ago
·
Hacker News
Actions for Apple's New AI Models Contain 'None' of Google's Gemini Assistant
LLM Research Papers: The 2026 List (January to May)
🧠
AI Research
Content type:
News
magazine.sebastianraschka.com
·
4d
4 days ago
·
Hacker News
Actions for LLM Research Papers: The 2026 List (January to May)
gaelazzo/python_chess: Chess trainer
🦀
Rust Systems
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for gaelazzo/python_chess: Chess trainer
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help