From Documents to Dialogue: A step-by-step RAG Journey
dev.to·18h·
Discuss: DEV
📊Multi-vector RAG
New Articles: Journal of Contemporary Archival Studies
archivespublishing.com·1d
⚖️Archive Ethics
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.app·10h·
Discuss: Hacker News
📜Binary Philology
Few people, many tasks: The minimalist team that powers Vietnamese Wikipedia
diff.wikimedia.org·11h
🌱Personal Wikis
Welcome to LIL’s Data.gov Archive Search
lil.law.harvard.edu·12h
💾Data Preservation
My first homelab project!
i.redd.it·1d·
Discuss: r/homelab
🏠Homelab
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.co·16h·
Discuss: Hacker News
🤖AI Curation
Creating Real-Time Multimodal AI Pipelines: Scaling File Processing to 50M Daily Uploads
engineering.salesforce.com·8h
🌊Stream Processing
Announcing the 2025 NDSA Excellence Award Winners
ndsa.org·19h
🏛️PREMIS Metadata
Sales pitch about why you should learn statistics
minireference.com·15h
🧠Intelligence Compression
2025-10-10: An Internship Experience With the Internet Archive as a Google Summer of Code Contributor
ws-dl.blogspot.com·9h·
🔓Open Source Software
Unlocking Faster Insights with Experimenter-Defined Segmentations
etsy.com·2d
📝Document Chunking
Offensive OSINT s05e10 - Interactive investigative stories part 1
offensiveosint.io·2d
🌐WARC Forensics
The artificial complexity of OOXML files (the PPTX case)
blog.documentfoundation.org·15h·
Discuss: Hacker News
📟Terminal Typography
Data Management for Collaborations
dataabinitio.com·4d
📦METS Containers
A gentle introduction to Generative AI: Historical perspective
medium.com·7h·
Discuss: Hacker News
🧠Learned Codecs
The worst research papers I’ve ever published
statmodeling.stat.columbia.edu·1d
🧮Kolmogorov Bounds
Complex networks-based anomaly detection for financial transactions in anti-money laundering
sciencedirect.com·2d
🧬PostgreSQL Forensics
LINQ and Learning to Be Declarative
nickstambaugh.dev·1d·
Discuss: Hacker News
🔗Concatenative Programming