Neural Recognition, Document AI, Layout Analysis, Multi-modal Processing
Automatic Pronunciation Error Detection and Correction of the Holy Quran's Learners Using Deep Learning
arxiv.orgΒ·1d
Lesion-Aware Visual-Language Fusion for Automated Image Captioning of Ulcerative Colitis Endoscopic Examinations
arxiv.orgΒ·2h
IS${}^3$ : Generic Impulsive--Stationary Sound Separation in Acoustic Scenes using Deep Filtering
arxiv.orgΒ·2h
Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on irregular geometry
arxiv.orgΒ·2h
Heatmap Guided Query Transformers for Robust Astrocyte Detection across Immunostains and Resolutions
arxiv.orgΒ·2h
S2M2ECG: Spatio-temporal bi-directional State Space Model Enabled Multi-branch Mamba for ECG
arxiv.orgΒ·2h
Multi-Scale Deep Learning for Colon Histopathology: A Hybrid Graph-Transformer Approach
arxiv.orgΒ·2h
Loading...Loading more...