From Documents to Dialogue: A step-by-step RAG Journey
dev.toยท12hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
The Dunhuang Culture ๆ•ฆ็…Œๆ–‡ๅŒ– Database
digitalorientalist.comยท13h
๐Ÿ“œText Collation
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.appยท5hยท
Discuss: Hacker News
๐Ÿ“œBinary Philology
Welcome to LILโ€™s Data.gov Archive Search
lil.law.harvard.eduยท6h
๐Ÿ’พData Preservation
Creating Real-Time Multimodal AI Pipelines: Scaling File Processing to 50M Daily Uploads
engineering.salesforce.comยท3h
๐ŸŒŠStream Processing
Pecia system
rhollick.wordpress.comยท1d
๐Ÿ’งManuscript Watermarks
Markdown2pdf โ€“ pure md to pdf transpiler implementation in Rust
github.comยท16hยท
Discuss: Hacker News
๐Ÿ“„PDF Internals
ParsTranslit: Truly Versatile Tajik-Farsi Transliteration
arxiv.orgยท22h
๐Ÿ“œDigital Philology
EagleFiler 1.9.19
tidbits.comยท4h
๐Ÿ“ธTIFF Evolution
MultiPar 1.3.3.5 Beta / 1.3.2.9
scour.ingยท19h
๐ŸบZIP Archaeology
Research Opportunity with Royal Museums Greenwich
dpconline.orgยท1d
๐ŸบComputational Archaeology
New Articles: Journal of Contemporary Archival Studies
archivespublishing.comยท1d
โš–๏ธArchive Ethics
The Best Ways to Digitize Your Notes
lifehacker.comยท14h
๐Ÿ“„Document Digitization
Few people, many tasks: The minimalist team that powers Vietnamese Wikipedia
diff.wikimedia.orgยท6h
๐ŸŒฑPersonal Wikis
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.coยท11hยท
Discuss: Hacker News
๐Ÿค–AI Curation
The artificial complexity of OOXML files (the PPTX case)
blog.documentfoundation.orgยท10hยท
Discuss: Hacker News
๐Ÿ“ŸTerminal Typography
OCR vs ADE: Mechanisms Behind the Methods
dev.toยท1dยท
Discuss: DEV
๐Ÿ“„OCR
My First Week of Vibecoding
underreacted.leaflet.pubยท17mยท
Discuss: Hacker News
๐ŸŽฏGradual Typing
Announcing the 2025 NDSA Excellence Award Winners
ndsa.orgยท14h
๐Ÿ›๏ธPREMIS Metadata
Announcing coreboot 25.09 release
blogs.coreboot.orgยท3h
๐Ÿ”ŒOperating system internals