Vision-Language Model Guided Image Restoration
arxiv.org·5d
🎧Audio Restoration
Preview
Report Post

Title:Vision-Language Model Guided Image Restoration

View PDF HTML (experimental)

Abstract:Many image restoration (IR) tasks require both pixel-level fidelity and high-level semantic understanding to recover realistic photos with fine-grained details. However, previous approaches often struggle to effectively leverage both the visual and linguistic knowledge. Recent efforts have attempted to incorporate Vision-language models (VLMs), which excel at aligning visual and textual features, into universal IR. Nevertheless, these methods fail to utilize the linguistic priors to ensure semantic coherence during the restoration process. To address this issue, in this paper, we propose the Vision-Language Mo…

Similar Posts

Loading similar posts...