Geometry-Consistent Endoscopic Representations for Image-Guided Navigation via Structured Foundation Model Adaptation (opens in new tab)

Accurate vision-based navigation in monocular endoscopy is difficult due to limited depth cues, weak tissue texture, non-rigid deformation, and substantial appearance variation across domains, all of which complicate pose estimation, depth prediction, and image-to-anatomy alignment. Although recent vision foundation models have shown promise, their learned representations often remain insufficiently geometry-consistent, hindering stable feature ...

Read the original article