OCR vs ADE: Mechanisms Behind the Methods
dev.toยท1dยท
Discuss: DEV
๐Ÿ“„OCR
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.appยท3hยท
Discuss: Hacker News
๐Ÿ“œBinary Philology
Creating Real-Time Multimodal AI Pipelines: Scaling File Processing to 50M Daily Uploads
engineering.salesforce.comยท1h
๐ŸŒŠStream Processing
The artificial complexity of OOXML files (the PPTX case)
blog.documentfoundation.orgยท7hยท
Discuss: Hacker News
๐Ÿ“ŸTerminal Typography
When mathematics meets aesthetics: Tessellations as a precise tool for solving complex problems
phys.orgยท8h
๐Ÿ“Mathematical Art
The Dunhuang Culture ๆ•ฆ็…Œๆ–‡ๅŒ– Database
digitalorientalist.comยท11h
๐Ÿ“œText Collation
Nearest Neighbor CCP-Based Molecular Sequence Analysis
arxiv.orgยท20h
๐Ÿ”„Burrows-Wheeler
Welcome to LILโ€™s Data.gov Archive Search
lil.law.harvard.eduยท4h
๐Ÿ’พData Preservation
Scientists May Have Decoded the Mysterious Language of a Lost City
popularmechanics.comยท12h
๐Ÿ—๏ธPaleocryptography
Pecia system
rhollick.wordpress.comยท1d
๐Ÿ’งManuscript Watermarks
The Best Ways to Digitize Your Notes
lifehacker.comยท12h
๐Ÿ“„Document Digitization
ScribeOCR โ€“ Web interface for recognizing text, OCR, & creating digitized docs
github.comยท4dยท
Discuss: Hacker News
๐Ÿ“„Document Streaming
From Documents to Dialogue: A step-by-step RAG Journey
dev.toยท10hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
Evaluating OCR performance on food packaging labels in South Africa
arxiv.orgยท3d
๐Ÿ“„OCR
Announcing the 2025 NDSA Excellence Award Winners
ndsa.orgยท12h
๐Ÿ›๏ธPREMIS Metadata
People rescuing forgotten knowledge trapped on old floppy disks
bbc.comยท12hยท
๐Ÿ“ผCassette Archaeology
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.coยท9hยท
Discuss: Hacker News
๐Ÿค–AI Curation
To MD - Convert PDFs, Word, HTML and more to Markdown
tomd.ioยท17hยท
Discuss: Hacker News
๐Ÿ”„Migration Tools
Advancing Outlook email archiving & Digital Preservation at your organization
preservica.comยท1d
๐Ÿ”„Archival Workflows
IASC: Interactive Agentic System for ConLangs
arxiv.orgยท20h
๐ŸŒณContext free grammars