WorldCat Editions and Holdings Release
annas-archive.orgยท22hยท
Discuss: Hacker News
๐Ÿ“šMARC Records
The Rise of Semantic Entity Resolution
towardsdatascience.comยท21h
๐Ÿ“„Semantic Chunking
UTF-8 Is Beautiful
hackaday.comยท8h
๐Ÿ”ฃUnicode
Preserving the digital legacy of company archives: Last stop, Newhaven.
dpconline.orgยท5h
๐Ÿ’พData Preservation
Ancient Scripts, Modern AI: Bridging the Divide with Morphology-Aware Tokenization by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
๐Ÿ“Concrete Syntax
From Legal Documents to Knowledge Graphs
neo4j.comยท2dยท
Discuss: Hacker News
๐Ÿ“‹Document Grammar
Sindhi Halchal Archive: Building on the PG Sindhi Library
digitalorientalist.comยท3d
๐ŸŒWeb Archiving
Satyajit Das: On Reading โ€“ Textual Pleasures
nakedcapitalism.comยท2d
๐Ÿ“•Bookbinding
Lessons from using AI in Discovery
thoughtbot.comยท13h
๐Ÿ•ต๏ธMetadata Mining
Vibe Graveyard
vibegraveyard.aiยท3h
๐Ÿ“ผCassette Culture
Show HN: I Built a Free Site for Students to Test Their Knowledge on Their Notes
pdftoquiz.comยท22hยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
Listening to Unreliable Narrators
secondvoice.substack.comยท2hยท
Discuss: Substack
๐ŸฐManuscript Networks
LLM Rerankers for RAG: A Practical Guide
fin.aiยท16hยท
๐Ÿ”Information Retrieval
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.comยท9hยท
๐ŸงฎKolmogorov Complexity
Kindred and Co-located Events: PARBICA 21 Demystifying Digital
ipres2025.nzยท11h
๐Ÿ›๏ธNordic Archives
Cliodynamics โ€“ History as Science
peterturchin.comยท3dยท
Discuss: Hacker News
๐Ÿ“œManuscript Calculus
Text-to-SQL Oriented to the Process Mining Domain: A PT-EN Dataset for Query Translation
arxiv.orgยท9h
๐Ÿ“‹Document Grammar
ALIGNS: Unlocking nomological networks in psychological measurement through a large language model
arxiv.orgยท9h
๐Ÿง Intelligence Compression
How to Train an LLM-Recommender Hybrid that Speaks English & Item IDs
eugeneyan.comยท1d
๐Ÿ”Information Retrieval