From Documents to Dialogue: A step-by-step RAG Journey
dev.toยท8hยท
Discuss: DEV
๐Ÿ“ŠMulti-vector RAG
The Dunhuang Culture ๆ•ฆ็…Œๆ–‡ๅŒ– Database
digitalorientalist.comยท9h
๐Ÿ“œText Collation
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.appยท52mยท
Discuss: Hacker News
๐Ÿ“œBinary Philology
Welcome to LILโ€™s Data.gov Archive Search
lil.law.harvard.eduยท2h
๐Ÿ’พData Preservation
Pecia system
rhollick.wordpress.comยท1d
๐Ÿ’งManuscript Watermarks
Markdown2pdf โ€“ pure md to pdf transpiler implementation in Rust
github.comยท11hยท
Discuss: Hacker News
๐Ÿ“„PDF Internals
ParsTranslit: Truly Versatile Tajik-Farsi Transliteration
arxiv.orgยท18h
๐Ÿ“œDigital Philology
MultiPar 1.3.3.5 Beta / 1.3.2.9
scour.ingยท14h
๐ŸบZIP Archaeology
Research Opportunity with Royal Museums Greenwich
dpconline.orgยท1d
๐ŸบComputational Archaeology
New Articles: Journal of Contemporary Archival Studies
archivespublishing.comยท1d
โš–๏ธArchive Ethics
The Best Ways to Digitize Your Notes
lifehacker.comยท9h
๐Ÿ“„Document Digitization
Few people, many tasks: The minimalist team that powers Vietnamese Wikipedia
diff.wikimedia.orgยท1h
๐ŸŒฑPersonal Wikis
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.coยท6hยท
Discuss: Hacker News
๐Ÿค–AI Curation
The artificial complexity of OOXML files (the PPTX case)
blog.documentfoundation.orgยท5hยท
Discuss: Hacker News
๐Ÿ“ŸTerminal Typography
OCR vs ADE: Mechanisms Behind the Methods
dev.toยท1dยท
Discuss: DEV
๐Ÿ“„OCR
Announcing the 2025 NDSA Excellence Award Winners
ndsa.orgยท9h
๐Ÿ›๏ธPREMIS Metadata
Title TBA
usenix.orgยท16h
๐Ÿ“„PostScript
October 10, 2025: Bellingcat Online Workshop on RuNet Investigations (4-hour) [Americas / Europe-friendly time]
bellingcat.comยท22h
๐Ÿ”Polish Cryptanalysis