LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning
arxiv.org·7h
🔲Cellular Automata
Preview
Report Post

View PDF HTML (experimental)

Abstract:Large reasoning models achieve strong performance on complex tasks by generating extended chains of thought, but they often "overthink": continuing to reason long after they have enough information to answer correctly. This wastes inference-time compute and can hurt accuracy. Existing attempts to stop early either manipulate decoding with extra sampling and heuristics, rely on auxiliary verifier models, or operate only as post-hoc analysis pipelines without formal guarantees. We introduce LYNX, an online early-exit mechanism that turns a model’s own hidden-state awareness into confidence-controlled stopping decisions. LYNX attaches exit decisions to naturally occurrin…

Similar Posts

Loading similar posts...