Handwritten Text Recognition for Low Resource Languages
arxiv.org·5d
📄OCR
Preview
Report Post

Title:Handwritten Text Recognition for Low Resource Languages

View PDF HTML (experimental)

Abstract:Despite considerable progress in handwritten text recognition, paragraph-level handwritten text recognition, especially in low-resource languages, such as Hindi, Urdu and similar scripts, remains a challenging problem. These languages, often lacking comprehensive linguistic resources, require special attention to develop robust systems for accurate optical character recognition (OCR). This paper introduces BharatOCR, a novel segmentation-free paragraph-level handwritten Hindi and Urdu text recognition. We propose a ViT-Transformer Decoder-LM architecture for handwritten text recognition, where a Vision T…

Similar Posts

Loading similar posts...