MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models
arxiv.org·8h
🤖Advanced OCR
Preview
Report Post

View PDF HTML (experimental)

Abstract:Recently, multimodal large language models (MLLMs) have been widely applied to reasoning tasks. However, they suffer from limited multi-rationale semantic modeling, insufficient logical robustness, and are susceptible to misleading interpretations in complex scenarios. Therefore, we propose a Multi-rationale INtegrated Discriminative (MIND) reasoning framework, which is designed to endow MLLMs with human-like cognitive abilities of "Understand -> Rethink -> Correct", and achieves a paradigm evolution from passive imitation-based reasoning to active discriminative reasoning. Specifically, we introduce a Rationale Augmentation and Discrimination (RAD) paradigm, w…

Similar Posts

Loading similar posts...