Neural Recognition, Document AI, Layout Analysis, Multi-modal Processing
RefSTAR: Blind Facial Image Restoration with Reference Selection, Transfer, and Reconstruction
arxiv.org·2d
SurgeryLSTM: A Time-Aware Neural Model for Accurate and Explainable Length of Stay Prediction After Spine Surgery
arxiv.org·18h
Representation learning with a transformer by contrastive learning for money laundering detection
arxiv.org·2d
Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker
arxiv.org·18h
ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models
arxiv.org·2d
Loading...Loading more...