reinforcement learning

Feeds to Scour
SubscribedAll
Scoured 31 posts in 7.1 ms

Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)

馃Зoperations researchContent type: Academic
web.mit.eduHacker News

Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data

馃Зoperations research

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

馃搳linear programming
pub.towardsai.net

Propel: Breaking the Solver Bottleneck in Task-Generator RL

馃搳linear programming
vmax.aiHacker News

Why LLMs (still) lack taste

馃Зoperations research

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

馃Зoperations research
venturebeat.comHacker News

See, Act, Correct: three levers for working with a code agent

馃Зoperations researchContent type: Blog

Agentic RL: Token-In, Token-Out Done Right

馃搳linear programming

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents

馃Зoperations researchContent type: Blog

AI-powered living business intelligence network

馃Зoperations research
atlasforgex.com
Hacker News

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

馃搳linear programming

Beyond Dexterity: Why Contact May Define the Next Era of Robotics

馃Зoperations researchContent type: VideoContent type: News

Memoirs of a Learning Machine: Autobiographical Self-Training and the Self-Training Gap

馃Зoperations research
zenodo.orgHacker News

Stack Overflow didn't just help AI learn to code

馃Rust

Vibe Diaries: Training Nanochat

馃Rust
vibediary.devHacker News

The Effective Sample Size

馃Зoperations research
alex.smola.orgHacker News

Nvidia Nemotron 3 Ultra

馃Rust

Apple's New AI Models Contain 'None' of Google's Gemini Assistant

馃搳linear programmingContent type: News
macrumors.comHacker News

Arithmetic Pedagogy for Language Models

馃搳linear programmingContent type: Academic
arxiv.orgHacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help