CogDoc: Towards Unified thinking in Documents
arxiv.org·9h
🤖Advanced OCR
Preview
Report Post

Title:CogDoc: Towards Unified thinking in Documents

View PDF HTML (experimental)

Abstract:Current document reasoning paradigms are constrained by a fundamental trade-off between scalability (processing long-context documents) and fidelity (capturing fine-grained, multimodal details). To bridge this gap, we propose CogDoc, a unified coarse-to-fine thinking framework that mimics human cognitive processes: a low-resolution "Fast Reading" phase for scalable information localization,followed by a high-resolution "Focused Thinking" phase for deep reasoning. We conduct a rigorous investigation into post-training strategies for the unified thinking framework, demonstrating that a Direct Reinforcement Lear…

Similar Posts

Loading similar posts...