Transformer Models, Layout Analysis, OCR Enhancement, Information Extraction
UMATO: Bridging Local and Global Structures for Reliable Visual Analytics with Dimensionality Reduction
arxiv.org·1d
Hierarchical Contextual Grounding LVLM: Enhancing Fine-Grained Visual-Language Understanding with Robust Grounding
arxiv.org·14h
Segmentation and Classification of Pap Smear Images for Cervical Cancer Detection Using Deep Learning
arxiv.org·14h
Loading...Loading more...