Next-gen Image Format, Lossless Transition, Progressive Decoding, Adaptive Quantization
CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation
arxiv.org·1h
MSPCaps: A Multi-Scale Patchify Capsule Network with Cross-Agreement Routing for Visual Recognition
arxiv.org·1h
First Place Solution to the MLCAS 2025 GWFSS Challenge: The Devil is in the Detail and Minority
arxiv.org·1h
An Efficient Dual-Line Decoder Network with Multi-Scale Convolutional Attention for Multi-organ Segmentation
arxiv.org·1h
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
arxiv.org·1h
A comparative study of some wavelet and sampling operators on various features of an image
arxiv.org·5d
Loading...Loading more...