MultiMoE: Multimodal Mixture of Experts Adapter for Semantic Segmentation of Remote Sensing Images (opens in new tab)

Multimodal remote sensing image semantic segmentation has benefited remarkably from recent progress in deep learning. However, most existing methods treat heterogeneous modalities with excessive uniformity, failing to fully exploit their complementary features. Furthermore, the absence of visual prior knowledge limits their generalization capabilities across diverse scenarios. To address these challenges, we propose a novel multimodal fine-tuning framework based on vision foundation models, n...

Read the original article