Entropy-Guided Reasoning Compression
arxiv.org·4d
Flag this post

Title:Entropy-Guided Reasoning Compression

View PDF HTML (experimental)

Abstract:Large reasoning models have demonstrated remarkable performance on complex reasoning tasks, yet the excessive length of their chain-of-thought outputs remains a major practical bottleneck due to high computation cost and poor deployability. Existing compression methods have achieved partial success but overlook a crucial phenomenon in the training process – the entropy conflict. During compression training, entropy decreases, leading to shorter reasoning but limited exploration, while accuracy-oriented objectives increase entropy, lengthening reasoning chains. This can cause the model to get stuck in a local dilemma. Our a…

Similar Posts

Loading similar posts...