FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and Detection
arxiv.org·3h
🧠AI
Flag this post
A benchmark multimodal oro-dental dataset for large vision-language models
arxiv.org·1d
🧠AI
Flag this post
A Low-Rank Method for Vision Language Model Hallucination Mitigation in Autonomous Driving
arxiv.org·3h
🧠AI
Flag this post
Classification of Microplastic Particles in Water using Polarized Light Scattering and Machine Learning Methods
arxiv.org·3h
🧠AI
Flag this post
Understanding Convolutional Neural Networks
pub.towardsai.net·3d
🧠AI
Flag this post
Deep Learning for Molecules and Materials
🧠AI
Flag this post
Automatic Extraction of Road Networks by using Teacher-Student Adaptive Structural Deep Belief Network and Its Application to Landslide Disaster
arxiv.org·3h
🧠AI
Flag this post
A Second-Order Attention Mechanism For Prostate Cancer Segmentation and Detection in Bi-Parametric MRI
arxiv.org·3h
🧠AI
Flag this post
Automated Invoice Data Extraction: Using LLM and OCR
arxiv.org·3h
🧠AI
Flag this post
Convolutional Fully-Connected Capsule Network (CFC-CapsNet): A Novel and Fast Capsule Network
arxiv.org·3h
🧠AI
Flag this post
Understanding Cross Task Generalization in Handwriting-Based Alzheimer's Screening via Vision Language Adaptation
arxiv.org·3h
🧠AI
Flag this post
Real-world chemistry lab image dataset for equipment recognition across 25 apparatus categories
nature.com·5d
🧠AI
Flag this post
This Camera System Can Focus on Everything, Everywhere, All At Once
petapixel.com·11h
🧠AI
Flag this post
Loading...Loading more...