Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses (opens in new tab)

Covered by 6 sources including imjuya.github.io, venturebeat.com

Search agents are often trained as policies over growing transcripts: the model must decide how to search while also remembering what it has seen, which evidence is useful, which constraints remain open, and which claims have actually been checked. We argue that this formulation puts too much routine state management inside the policy: reinforcement learning is forced to optimize both semantic search decisions and recoverable bookkeeping that ...

Read the original article

Sign in to keep reading the full article.

Sign Up Log In

Covered in 6 articles

venturebeat.com·

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

Discussed on Hacker News

AI Newsletter·

🥇Top AI Papers of the Week

Gradient Ascent·

Spotify's Agent Context Layer, DeepMind's Nine Erdős Proofs, and GitHub's Spec-Kit - The Tokenizer Edition #31

View all 6 ›