Lossy Compression Bounds, Information Bottleneck, Perceptual Coding, Quality Metrics
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
arxiv.orgยท2h
Propose and Rectify: A Forensics-Driven MLLM Framework for Image Manipulation Localization
arxiv.orgยท2h
Benchmarking Class Activation Map Methods for Explainable Brain Hemorrhage Classification on Hemorica Dataset
arxiv.orgยท2h
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
arxiv.orgยท2h
Loading...Loading more...