Semantic Segmentation

Feeds to Scour
SubscribedAll
Scoured 43 posts in 5.3 ms

Training-Free Generalized Few-Shot Segmentation through Open-Vocabulary Semantic Arbitration

 👁️Computer Vision  Content type: Academic
arxiv.org·

SegmentAnyTreeV2: Scaling Transformer-Based Tree Instance Segmentation Across Sensors, Platforms, and Forests

 👁️Computer Vision  Content type: Academic
arxiv.org·

Mind the Gap: Disentangling Performance Bottlenecks in Video Instance Segmentation

 👁️Computer Vision  Content type: Academic
arxiv.org·

Zero-Parameter Geometric Gating for Temporally Stable Low-Altitude UAV Video Semantic Segmentation

 👁️Computer Vision  Content type: Academic
arxiv.org·

Segment and Select: Vision-Language Segmentation in 3D Scenarios

 👁️Computer Vision  Content type: Academic
arxiv.org·

iSAGE: A Human-in-the-Loop Framework for Remote Sensing Semantic Segmentation via Sparse Point Supervision

 👁️Computer Vision  Content type: Academic
arxiv.org·

PairWise Image Finder: An Open-source Tool for Finding Visually Aligned Street-Level Image Pairs for Urban Perception Studies

 👁️Computer Vision  Content type: Academic
arxiv.org·

MedSIGHT: Towards Grounded Visual Comprehension in Medical Large Vision-Language Models

 🔀Multimodal AI  Content type: Academic
arxiv.org·

S23DR 2026 Winning Solution

 👁️Computer Vision  Content type: Academic
arxiv.org·

ZODS-RS -- Zero-training Oriented Detection & Segmentation for Remote Sensing

 👁️Computer Vision  Content type: Academic
arxiv.org·

Geometric Coastline Localization using Vision-Language Models

 🔀Multimodal AI  Content type: Academic
arxiv.org·

CheXanatomy: Anatomy-Aware Vision-Language Modeling for Chest Radiographs

 👁️Computer Vision  Content type: Academic
arxiv.org·
Less-relevant results

Advanced Flood Prediction with Physics-Guided Deep Learning: Combining UNet, FNO, and SAR/Optical Imagery

 👁️Computer Vision  Content type: Academic
arxiv.org·

AMN: An Adaptive Multi-Scale Fusion Network with Boundary and Uncertainty Modeling for Nuclei Segmentation

 👁️Computer Vision  Content type: Academic
arxiv.org·

Temporal Context Conditioning for Seasonality-Aware Precipitation Nowcasting of High-Intensity Rainfall

 🫧Gaussian Splatting  Content type: Academic
arxiv.org·

TrioPose: Native Triple-Stream Diffusion Transformers for Pose-Guided Text-to-Image Generation

 👁️Computer Vision  Content type: Academic
arxiv.org·

WHU-Infra3D: A Full-stack Multi-modal Dataset and Benchmark for 3D Roadside Infrastructure Inventory

 📡Point Clouds  Content type: Academic
arxiv.org·

PhysGraph: A Physics-aware 3D Scene Graph for Perception and Reasoning

 👁️Computer Vision  Content type: Academic
arxiv.org·

Globally Localizing Lunar Rover in Pixels via Graph Alignment

 👁️Computer Vision  Content type: Academic
arxiv.org·

Video-Rate Streaming Stylization on a Vision-Aware MLLM-Conditioned Edit Diffusion: Asymmetric Batched Inference on a Distilled UNet + MLLM Text Encoder

 👁️Computer Vision  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help