Perceptual Loss, Neural Distance, Embedding Similarity, Content-aware Metrics
CmFNet: Cross-modal Fusion Network for Weakly-supervised Segmentation of Medical Images
arxiv.orgยท1d
MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection
arxiv.orgยท14h
General Methods Make Great Domain-specific Foundation Models: A Case-study on Fetal Ultrasound
arxiv.orgยท14h
Kernel spectral joint embeddings for high-dimensional noisy datasets using duo-landmark integral operators
arxiv.orgยท1d
Flatness After All?
arxiv.orgยท1d
Visual hallucination detection in large vision-language models via evidential conflict
arxiv.orgยท14h
BPCLIP: A Bottom-up Image Quality Assessment from Distortion to Semantics Based on CLIP
arxiv.orgยท1d
Loading...Loading more...