From Documents to Dialogue: A step-by-step RAG Journey
dev.to·12h·
Discuss: DEV
📊Multi-vector RAG
New Articles: Journal of Contemporary Archival Studies
archivespublishing.com·1d
⚖️Archive Ethics
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.app·5h·
Discuss: Hacker News
📜Binary Philology
Few people, many tasks: The minimalist team that powers Vietnamese Wikipedia
diff.wikimedia.org·6h
🌱Personal Wikis
From Keywords to Clusters: AI-Driven Analysis of YouTube Comments to Reveal Election Issue Salience in 2024
arxiv.org·22h
📊Feed Optimization
Welcome to LIL’s Data.gov Archive Search
lil.law.harvard.edu·6h
💾Data Preservation
My first homelab project!
i.redd.it·1d·
Discuss: r/homelab
🏠Homelab
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.co·10h·
Discuss: Hacker News
🤖AI Curation
Creating Real-Time Multimodal AI Pipelines: Scaling File Processing to 50M Daily Uploads
engineering.salesforce.com·2h
🌊Stream Processing
Announcing the 2025 NDSA Excellence Award Winners
ndsa.org·14h
🏛️PREMIS Metadata
Sales pitch about why you should learn statistics
minireference.com·9h
🧠Intelligence Compression
2025-10-10: An Internship Experience With the Internet Archive as a Google Summer of Code Contributor
ws-dl.blogspot.com·4h·
🔓Open Source Software
The Dunhuang Culture 敦煌文化 Database
digitalorientalist.com·13h
📜Text Collation
Unlocking Faster Insights with Experimenter-Defined Segmentations
etsy.com·2d
📝Document Chunking
Offensive OSINT s05e10 - Interactive investigative stories part 1
offensiveosint.io·2d
🌐WARC Forensics
The artificial complexity of OOXML files (the PPTX case)
blog.documentfoundation.org·9h·
Discuss: Hacker News
📟Terminal Typography
Data Management for Collaborations
dataabinitio.com·4d
📦METS Containers
A gentle introduction to Generative AI: Historical perspective
medium.com·1h·
Discuss: Hacker News
🧠Learned Codecs
The worst research papers I’ve ever published
statmodeling.stat.columbia.edu·1d
🧮Kolmogorov Bounds
Complex networks-based anomaly detection for financial transactions in anti-money laundering
sciencedirect.com·2d
🧬PostgreSQL Forensics