Evaluating OCR performance on food packaging labels in South Africa
arxiv.orgยท4h
๐Ÿ‘๏ธOCR Verification
ScribeOCR โ€“ Web interface for recognizing text, OCR, & creating digitized docs
github.comยท22hยท
Discuss: Hacker News
๐Ÿ“„Document Streaming
OEM technology โ€“ callasโ€™s best kept secret
pdfa.orgยท13h
๐Ÿ“„Document Digitization
Optimizing a QuickTake Image Decoder for the Apple IIโ€™s 6502
hackaday.comยท1d
๐ŸŽApple II Heritage
Lowercase leaving you cold? Introducing Retrocide
theregister.comยท16h
๐Ÿ” Terminal Fonts
Advances in Medical Image Segmentation: A Comprehensive Survey with a Focus on Lumbar Spine Applications
arxiv.orgยท4h
๐ŸŒ€Riemannian Computing
Automated Spectral Deconvolution & Peak Profiling for Bioprocess Monitoring
dev.toยท8hยท
Discuss: DEV
๐Ÿ“„Document Digitization
Super-resolution image projection over an extended depth of field using a diffractive decoder
arxiv.orgยท4h
๐ŸŒˆHolographic Storage
The method of the approximate inverse for limited-angle CT
arxiv.orgยท4h
๐ŸบComputational Archaeology
Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior
arxiv.orgยท4h
๐Ÿค–Advanced OCR
Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion
arxiv.orgยท4h
โŸทBidirectional Programming
Detection of retinal diseases using an accelerated reused convolutional network
arxiv.orgยท4h
๐ŸŒ€Riemannian Computing
Multi-Modal Oral Cancer Detection Using Weighted Ensemble Convolutional Neural Networks
arxiv.orgยท4h
๐Ÿง Machine Learning
Detecting Notational Errors in Digital Music Scores
arxiv.orgยท1d
๐ŸŽผComputational Musicology
DHQA-4D: Perceptual Quality Assessment of Dynamic 4D Digital Human
arxiv.orgยท4h
๐Ÿ“ŠRate-Distortion Theory
HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion
arxiv.orgยท1d
๐Ÿค–Advanced OCR
Read Between the Lines: A Benchmark for Uncovering Political Bias in Bangla News Articles
arxiv.orgยท4h
โš™๏ธCompression Benchmarking
Automating construction safety inspections using a multi-modal vision-language RAG framework
arxiv.orgยท4h
๐Ÿค–Advanced OCR
Visual Representations inside the Language Model
arxiv.orgยท4h
๐ŸงฎVector Embeddings