🤖 Reinforcement Learning - nmarshall · Scour

See, Act, Correct: three levers for working with a code agent

🤖AI agents Blog

blog.owulveryck.info··Hacker News, Hacker News

Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data

anjalishriva.com··Hacker News

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

venturebeat.com··Hacker News

How to Train Your Goblin

✍️Prompt Engineering

goblins.mchen.workers.dev··Hacker News, Hacker News

OpenEnv is now owned by HF, Torch, Prime Intellect, Unsloth, Modal, Mercor, and more! Use it for training agents.

🤖AI agents Blog

huggingface.co··Hacker News, r/LocalLLaMA

Good teachers don’t cheat

📡Information Theory Blog

jasonkena.github.io··Hacker News

Agentic RL: Token-In, Token-Out Done Right

qgallouedec-tito.hf.space··Hacker News

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents

🤖AI agents Blog

developer.nvidia.com··Hacker News

Of Termites & Tokens

tomcritchlow.com··Hacker News

Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)

📊Algorithms Academic

web.mit.edu··Hacker News

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

How to Stop Shipping Low-Quality RL Environments (with Examples)

🧬biology News

latent.space··Hacker News

Alpha-RTL: Test-Time Training for RTL Hardware Optimization

🔌FPGA Academic

Memoirs of a Learning Machine: Autobiographical Self-Training and the Self-Training Gap

zenodo.org··Hacker News

The Exploit Always Wins

🤖AI agents Blog

abhishek-shankar.com·

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

thiagolira.blot.im··Hacker News

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

🔥PyTorch Code

github.com··Hacker News

Why Robotics Is a Pre-Paradigm Field

🤖Swarm Robotics News

whattotelltherobot.com··Hacker News

Nvidia Nemotron 3 Ultra

research.nvidia.com··Hacker News

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

📱Edge AI Blog

blog.thiagolira.com.br··Hacker News

No more posts from nmarshall's subscribed feeds.

Scour all 25255 feeds Learn more about Feeds

Log in to enable infinite scrolling