Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning
🎮 Reinforcement Learning
AI Agents, Reward Systems, Game Theory, Q-Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
62
posts in
12.8
ms
Reinforcement
Learning
and Optimal Control Book (RIP Dimitri Bertsekas)
🧮
Algorithms
Content type:
Academic
web.mit.edu
·
5d
5 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
🔌
Model Context Protocol
anjalishriva.com
·
1d
1 day ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
SFT Offline
RL
Online
RL
: The Three-Stage Training Pipeline Behind Mano-P
🤖
AI
Content type:
Blog
dev.to
·
23h
23 hours ago
·
DEV
Actions for SFT Offline RL Online RL: The Three-Stage Training Pipeline Behind Mano-P
Model predictive task sampling for efficient and robust adaptation
📊
Approximate Computing
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for Model predictive task sampling for efficient and robust adaptation
The Fundamental Choice in
Reinforcement
Learning
: On‑
Policy
vs. Off‑
Policy
🎲
Game Theory
towardsdatascience.com
·
5d
5 days ago
Actions for The Fundamental Choice in Reinforcement Learning: On‑Policy vs. Off‑Policy
AI-powered
living business intelligence network
📇
Indexing Strategies
atlasforgex.com
·
16h
16 hours ago
·
Hacker News
Actions for AI-powered living business intelligence network
SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
🧬
Optimization Algorithms
Content type:
Code
github.com
·
3d
3 days ago
·
r/opensource
Actions for SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
Researchers trained an open source
AI
search
agent
, Harness-1, that outperforms GPT-5.4 on recalling relevant information
🤖
AI
venturebeat.com
·
2d
2 days ago
·
Hacker News
Actions for Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
How to Train Your Goblin
🤖
AI
goblins.mchen.workers.dev
·
3d
3 days ago
·
Hacker News
,
Hacker News
Actions for How to Train Your Goblin
How to Become an AWS
AI
Architect,The Honest Roadmap, the Projects, and Landing the Job
☁️
AWS Infrastructure
hackernoon.com
·
1d
1 day ago
Actions for How to Become an AWS AI Architect,The Honest Roadmap, the Projects, and Landing the Job
Beyond Dexterity: Why Contact May Define the Next Era of Robotics
🚀
Spacecraft Navigation
Content type:
Video
Content type:
News
spectrum.ieee.org
·
1d
1 day ago
·
Hacker News
Actions for Beyond Dexterity: Why Contact May Define the Next Era of Robotics
Human-Aligned
Decision
Transformers for satellite anomaly response operations with inverse simulation verification
🤖
AI
Content type:
Blog
dev.to
·
5d
5 days ago
·
DEV
Actions for Human-Aligned Decision Transformers for satellite anomaly response operations with inverse simulation verification
Why LLMs (still) lack taste
🤖
AI
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
I got so mad at poke(rogue)like that I trained a
RL
agent
to beat it for me
🤖
AI
thiagolira.blot.im
·
3d
3 days ago
·
Hacker News
Actions for I got so mad at poke(rogue)like that I trained a RL agent to beat it for me
Apple rebuilt Siri on Google’s
AI
and Nvidia’s chips, then spent WWDC explaining why that doesn’t break its privacy promise
🤖
Copilot
Content type:
News
thenextweb.com
·
1d
1 day ago
Actions for Apple rebuilt Siri on Google’s AI and Nvidia’s chips, then spent WWDC explaining why that doesn’t break its privacy promise
Is an Online Master’s Degree in
AI
a Good Idea?
🤖
AI
towardsdatascience.com
·
6d
6 days ago
Actions for Is an Online Master’s Degree in AI a Good Idea?
Apple's New
AI
Models Contain 'None' of Google's Gemini Assistant
📓
Jupyter
Content type:
News
macrumors.com
·
1d
1 day ago
·
Hacker News
Actions for Apple's New AI Models Contain 'None' of Google's Gemini Assistant
See,
Act
, Correct: three levers for working with a code
agent
🤖
AI
Content type:
Blog
blog.owulveryck.info
·
6d
6 days ago
·
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
Memoirs of a
Learning
Machine: Autobiographical Self-Training and the Self-Training Gap
🤖
Copilot
zenodo.org
·
4d
4 days ago
·
Hacker News
Actions for Memoirs of a Learning Machine: Autobiographical Self-Training and the Self-Training Gap
Reinforcement
learning
in linear embedding space unlocks generalizable control across soft robot configurations
📐
Vector Embeddings
Content type:
Academic
nature.com
·
3d
3 days ago
Actions for Reinforcement learning in linear embedding space unlocks generalizable control across soft robot configurations
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help