DeepSeek-OCR + LLama4 + RAG Just Revolutionized Agent OCR Forever
dev.toยท1hยท
Discuss: DEV
๐Ÿค–Advanced OCR
Flag this post
Do We Still Need OCR?
pageindex.aiยท1dยท
๐Ÿค–Advanced OCR
Flag this post
All Hail The OC71
hackaday.comยท1h
๐Ÿ”ŒOperating system internals
Flag this post
CT-CLIP: A Multi-modal Fusion Framework for Robust Apple Leaf Disease Recognition in Complex Environments
arxiv.orgยท2d
๐Ÿค–Advanced OCR
Flag this post
This Chip Computes With Light, Breaking the 10 GHz Barrier for AI
scitechdaily.comยท14m
๐Ÿ”ฌOptical Physics
Flag this post
Scripts That Donโ€™t Fit: The Hidden Bias of NLP in South Asian Languages
digitalorientalist.comยท20h
๐Ÿ›Digital humanities
Flag this post
Display Resolution Calculator
cl.cam.ac.ukยท11hยท
Discuss: Hacker News
๐ŸŒˆColor Archaeology
Flag this post
Vision-Driven OCR for Long Documents: How Images Compress Text for LLMs
dev.toยท1dยท
Discuss: DEV
๐Ÿค–Advanced OCR
Flag this post
Abjad AI at NADI 2025: CATT-Whisper: Multimodal Diacritic Restoration Using Text and Speech Representations
arxiv.orgยท5h
๐ŸŽ™๏ธWhisper
Flag this post
Reasoning Visual Language Model for Chest X-Ray Analysis
arxiv.orgยท5h
๐ŸบComputational Archaeology
Flag this post
DeepSeek-OCR: Images Simplify Text for Large Language Models
heise.deยท4d
๐Ÿค–Advanced OCR
Flag this post
DeepOCR โ€“ Permanently free, multi-scenario OCR for receipts, docs, handwriting
deepocr.ccยท3dยท
Discuss: Hacker News
๐Ÿค–Advanced OCR
Flag this post
Show HN: Front End Fuzzy and Substring and Prefix Search
github.comยท3hยท
Discuss: Hacker News
๐ŸŒณTrie Structures
Flag this post
The Art and Science of Counting Pixels: The Math of Figment
fredbenenson.comยท1dยท
Discuss: Hacker News
๐Ÿ“Mathematical Art
Flag this post
Show HN: WayOfThat, Automatically detect and place fields and checkboxes on PDFs
wayofthat.comยท5hยท
Discuss: Hacker News
๐Ÿ“„Document Digitization
Flag this post
Long-tailed Species Recognition in the NACTI Wildlife Dataset
arxiv.orgยท2d
๐Ÿค–Advanced OCR
Flag this post
Morphology-Aware KOA Classification: Integrating Graph Priors with Vision Models
arxiv.orgยท1d
๐ŸŒ€Riemannian Computing
Flag this post
Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment
arxiv.orgยท1d
๐ŸงฎVector Embeddings
Flag this post