Reconstruction of existing buildings for evacuation assessment under emergency situations using 3D Gaussian splatting and machine learning
sciencedirect.comยท1h
๐ฐRemote sensing
Flag this post
Why Multimodal AI Broke the Data Pipeline โ And How Daft Is Beating Ray and Spark to Fix It
hackernoon.comยท1d
๐ง OpenAI
Flag this post
Reality check
โTechnology
Flag this post
Show HN: Hot or Slop โ Visual Turing test on how well humans detect AI images
๐Grad-CAM
Flag this post
Spatial Sense: Unleashing Language Models on Location Data by Arvind Sundararajan
๐ง OpenAI
Flag this post
ZEBRA: Towards Zero-Shot Cross-Subject Generalization for Universal Brain Visual Decoding
arxiv.orgยท1d
๐Grad-CAM
Flag this post
pDANSE: Particle-based Data-driven Nonlinear State Estimation from Nonlinear Measurements
arxiv.orgยท1d
โ๏ธPoint Cloud Processing
Flag this post
When to Trust the Answer: Question-Aligned Semantic Nearest Neighbor Entropy for Safer Surgical VQA
arxiv.orgยท11h
๐ง OpenAI
Flag this post
Mutual Information guided Visual Contrastive Learning
arxiv.orgยท11h
๐Grad-CAM
Flag this post
Deployable Vision-driven UAV River Navigation via Human-in-the-loop Preference Alignment
arxiv.orgยท11h
๐บGeometric Learning
Flag this post
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
arxiv.orgยท11h
๐ง OpenAI
Flag this post
FLoC: Facility Location-Based Efficient Visual Token Compression for Long Video Understanding
arxiv.orgยท11h
๐Bokeh
Flag this post
A high-resolution large-scale dataset for building segmentation from aerial imagery in northeastern Italy
nature.comยท1d
๐ฐRemote sensing
Flag this post
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
๐๏ธVision Transformers
Flag this post
Building a Multimodal RAG That Responds with Text, Images, and Tables from Sources
towardsdatascience.comยท20h
๐ง OpenAI
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
๐ง OpenAI
Flag this post
Decoding human safety perception with eye-tracking systems, street view images, and explainable AI
sciencedirect.comยท2d
๐Grad-CAM
Flag this post
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
arxiv.orgยท1d
๐Grad-CAM
Flag this post
Loading...Loading more...