reinforcement learning

Feeds to Scour
SubscribedAll
Scoured 31 posts in 4.0 ms

gaelazzo/python_chess: Chess trainer

馃搳linear programmingContent type: Code
github.comHacker News

LLM Research Papers: The 2026 List (January to May)

馃Зoperations researchContent type: News

AI model predicts building fire spread, redirecting evacuees to safer exits in real time

馃Зoperations research
techxplore.comHacker News

Why Robotics Is a Pre-Paradigm Field

馃Зoperations researchContent type: News

A wild idea: Abstract reality using ontology

馃Зoperations researchContent type: Discussion

Issue 654

馃Зoperations researchContent type: Blog

Best explanations of how LLMs work

馃搳linear programmingContent type: Blog

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

馃RustContent type: Code
github.comHacker News

Show HN: The Deterministic Core Architecture for AI-Augmented Applications

馃搳linear programming
Less-relevant results

Introducing the Third Generation of Apple鈥檚 Foundation Models

馃Rust

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

馃搳linear programmingContent type: Blog

No more posts from ddboline's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help