OSMGen: Highly Controllable Satellite Image Synthesis using OpenStreetMap Data
arxiv.orgยท8h
๐ง OpenAI
Flag this post
Deep Neural Watermarking for Robust Copyright Protection in 3D Point Clouds
arxiv.orgยท1d
โ๏ธPoint Cloud Processing
Flag this post
Disciplined Biconvex Programming
arxiv.orgยท8h
๐ขNumPy
Flag this post
NeuraSnip A Local Semantic Image Search Engine
๐ง OpenAI
Flag this post
Multi-Modal Feature Fusion for Spatial Morphology Analysis of Traditional Villages via Hierarchical Graph Neural Networks
arxiv.orgยท1d
๐ง OpenAI
Flag this post
M^3Detection: Multi-Frame Multi-Level Feature Fusion for Multi-Modal 3D Object Detection with Camera and 4D Imaging Radar
arxiv.orgยท1d
๐Computer vision
Flag this post
VLM6D: VLM based 6Dof Pose Estimation based on RGB-D Images
arxiv.orgยท8h
๐บGeometric Learning
Flag this post
Deep Generative Models for Enhanced Vitreous OCT Imaging
arxiv.orgยท8h
๐๏ธVision Transformers
Flag this post
A Hybrid Deep Learning and Forensic Approach for Robust Deepfake Detection
arxiv.orgยท1d
๐Grad-CAM
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.orgยท8h
๐ง OpenAI
Flag this post
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
arxiv.orgยท8h
๐ง OpenAI
Flag this post
Bridging Vision, Language, and Mathematics: Pictographic Character Reconstruction with B\'ezier Curves
arxiv.orgยท8h
๐บGeometric Learning
Flag this post
HyFormer-Net: A Synergistic CNN-Transformer with Interpretable Multi-Scale Fusion for Breast Lesion Segmentation and Classification in Ultrasound Images
arxiv.orgยท8h
๐๏ธVision Transformers
Flag this post
GeneFlow: Translation of Single-cell Gene Expression to Histopathological Images via Rectified Flow
arxiv.orgยท8h
๐Altair
Flag this post
Investigating Label Bias and Representational Sources of Age-Related Disparities in Medical Segmentation
arxiv.orgยท8h
๐Grad-CAM
Flag this post
Efficient Curvature-aware Graph Network
arxiv.orgยท8h
๐บGeometric Learning
Flag this post
FLoC: Facility Location-Based Efficient Visual Token Compression for Long Video Understanding
arxiv.orgยท8h
๐Bokeh
Flag this post
Building a Multimodal RAG That Responds with Text, Images, and Tables from Sources
towardsdatascience.comยท17h
๐ง OpenAI
Flag this post
Loading...Loading more...