Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
reinforcement learning
馃 reinforcement learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
31
posts in
4.0
ms
gaelazzo/python_chess: Chess trainer
聽
馃搳
linear programming
聽
Content type:
Code
github.com
路
1d
1 day ago
路
Hacker News
Actions for gaelazzo/python_chess: Chess trainer
LLM Research Papers: The 2026 List (January to May)
聽
馃З
operations research
聽
Content type:
News
magazine.sebastianraschka.com
路
4d
4 days ago
路
Hacker News
Actions for LLM Research Papers: The 2026 List (January to May)
AI
model
predicts building fire spread, redirecting evacuees to safer exits in real time
聽
馃З
operations research
techxplore.com
路
6d
6 days ago
路
Hacker News
Actions for AI model predicts building fire spread, redirecting evacuees to safer exits in real time
Why Robotics Is a Pre-Paradigm Field
聽
馃З
operations research
聽
Content type:
News
whattotelltherobot.com
路
4d
4 days ago
路
Hacker News
Actions for Why Robotics Is a Pre-Paradigm Field
A wild idea: Abstract reality using ontology
聽
馃З
operations research
聽
Content type:
Discussion
news.ycombinator.com
路
4d
4 days ago
路
Hacker News
Actions for A wild idea: Abstract reality using ontology
Issue 654
聽
馃З
operations research
聽
Content type:
Blog
datascienceweekly.substack.com
路
6d
6 days ago
路
Substack
Actions for Issue 654
Best explanations of how LLMs work
聽
馃搳
linear programming
聽
Content type:
Blog
vorushin.github.io
路
4d
4 days ago
路
Hacker News
Actions for Best explanations of how LLMs work
KJLdefeated/RL.cu
: RLVR training for LLM in CUDA/C++
聽
馃
Rust
聽
Content type:
Code
github.com
路
3d
3 days ago
路
Hacker News
Actions for KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
Show HN: The Deterministic Core Architecture for AI-Augmented Applications
聽
馃搳
linear programming
brandonbellsystems.com
路
5d
5 days ago
路
Hacker News
Actions for Show HN: The Deterministic Core Architecture for AI-Augmented Applications
Less-relevant results
Introducing the Third Generation of Apple鈥檚 Foundation
Models
聽
馃
Rust
machinelearning.apple.com
路
3d
3 days ago
路
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple鈥檚 Foundation Models
I got so mad at poke(rogue)like that I trained a
RL
agent
to beat it for me
聽
馃搳
linear programming
聽
Content type:
Blog
blog.thiagolira.com.br
路
6d
6 days ago
路
Hacker News
Actions for I got so mad at poke(rogue)like that I trained a RL agent to beat it for me
No more posts from ddboline's subscribed feeds.
Scour all
25257
feeds
Learn more about Feeds
« Page 1
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help