OCR vs ADE: Mechanisms Behind the Methods
dev.toยท1dยท
Discuss: DEV
๐Ÿ“„OCR
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.appยท43mยท
Discuss: Hacker News
๐Ÿ“œBinary Philology
The artificial complexity of OOXML files (the PPTX case)
blog.documentfoundation.orgยท5hยท
Discuss: Hacker News
๐Ÿ“ŸTerminal Typography
Physics-informed AI excels at large-scale discovery of new materials
phys.orgยท6h
๐Ÿง Machine Learning
The Dunhuang Culture ๆ•ฆ็…Œๆ–‡ๅŒ– Database
digitalorientalist.comยท9h
๐Ÿ“œText Collation
Nearest Neighbor CCP-Based Molecular Sequence Analysis
arxiv.orgยท18h
๐Ÿ”„Burrows-Wheeler
Welcome to LILโ€™s Data.gov Archive Search
lil.law.harvard.eduยท2h
๐Ÿ’พData Preservation
Scientists May Have Decoded the Mysterious Language of a Lost City
popularmechanics.comยท9h
๐Ÿ—๏ธPaleocryptography
Pecia system
rhollick.wordpress.comยท1d
๐Ÿ’งManuscript Watermarks
The Best Ways to Digitize Your Notes
lifehacker.comยท9h
๐Ÿ“„Document Digitization
ScribeOCR โ€“ Web interface for recognizing text, OCR, & creating digitized docs
github.comยท4dยท
Discuss: Hacker News
๐Ÿ“„Document Streaming
Advancing Outlook email archiving & Digital Preservation at your organization
preservica.comยท1d
๐Ÿ”„Archival Workflows
From Documents to Dialogue: A step-by-step RAG Journey
dev.toยท8hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
Evaluating OCR performance on food packaging labels in South Africa
arxiv.orgยท3d
๐Ÿ“„OCR
Announcing the 2025 NDSA Excellence Award Winners
ndsa.orgยท9h
๐Ÿ›๏ธPREMIS Metadata
People rescuing forgotten knowledge trapped on old floppy disks
bbc.comยท9hยท
Discuss: Hacker News
๐Ÿ“ผCassette Archaeology
GPT-5 for AI-assisted discovery
johndcook.comยท7h
๐ŸŽฏPerformance Proofs
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.coยท6hยท
Discuss: Hacker News
๐Ÿค–AI Curation
To MD - Convert PDFs, Word, HTML and more to Markdown
tomd.ioยท14hยท
Discuss: Hacker News
๐Ÿ”„Migration Tools
[R] A Unified Framework for Continual Semantic Segmentation in 2D and 3D Domains
reddit.comยท17hยท
๐Ÿ“Document Chunking