LDP: Parameter-Efficient Fine-Tuning of Multimodal LLM for Medical Report Generation
arxiv.org·3d
🧮Kolmogorov Complexity
Preview
Report Post

View PDF HTML (experimental)

Abstract:Colonoscopic polyp diagnosis is pivotal for early colorectal cancer detection, yet traditional automated reporting suffers from inconsistencies and hallucinations due to the scarcity of high-quality multimodal medical data. To bridge this gap, we propose LDP, a novel framework leveraging multimodal large language models (MLLMs) for professional polyp diagnosis report generation. Specifically, we curate MMEndo, a multimodal endoscopic dataset comprising expert-annotated colonoscopy image-text pairs. We fine-tune the Qwen2-VL-7B backbone using Parameter-Efficient Fine-Tuning (LoRA) and align it with clinical standards via Direct Preference Optimization (DPO). Extens…

Similar Posts

Loading similar posts...