Paleographic Recognition, Historical OCR, Deep Learning, Digital Humanities

OCR vs ADE: Mechanisms Behind the Methods
dev.to·1d·
Discuss: DEV
📄OCR
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.app·43m·
Discuss: Hacker News
📜Binary Philology
Welcome to LIL’s Data.gov Archive Search
lil.law.harvard.edu·2h
💾Data Preservation
Pecia system
rhollick.wordpress.com·1d
💧Manuscript Watermarks
IASC: Interactive Agentic System for ConLangs
arxiv.org·18h
🌳Context free grammars
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.co·6h·
Discuss: Hacker News
🤖AI Curation
The Dunhuang Culture 敦煌文化 Database
digitalorientalist.com·9h
📜Text Collation
The Best Ways to Digitize Your Notes
lifehacker.com·9h
📄Document Digitization
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.ai·22h
📊Feed Optimization
Show HN: Lore Engine – Turn 10-hour lectures into 2 hours of comprehensive notes
github.com·1d·
Discuss: Hacker News
📄Document Streaming
Unlocking Image Understanding: A New Path to Visual AI for Everyone
dev.to·1d·
Discuss: DEV
🤖AI Paleography
From Documents to Dialogue: A step-by-step RAG Journey
dev.to·8h·
Discuss: DEV
📊Multi-vector RAG
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.ai·1d·
Discuss: Hacker News
💻Local LLMs
Computable Babylonian Diaries Project
christopherwolfram.com·7h·
Discuss: Hacker News
📜Digital Philology
Efficient and accurate search in petabase-scale sequence repositories
nature.com·2d·
Discuss: Hacker News
🔄Burrows-Wheeler
The key to conversational speech recognition
datasciencecentral.com·1d
🎵Audio ML
The rapidly evolving field of artificial intelligence has le
dev.to·6h·
Discuss: DEV
🧭Content Discovery
The artificial complexity of OOXML files (the PPTX case)
blog.documentfoundation.org·5h·
Discuss: Hacker News
📟Terminal Typography
In-Depth Analysis: "Attention Is All You Need"
dev.to·6h·
Discuss: DEV
🧠Intelligence Compression