Extracting A Large Corpus from the Internet Archive, A Case Study
journal.code4lib.org·1d
🏺ZIP Archaeology
Flag this post
[Project] termiNAS: Self-hosted storage server with ransomware protection via Btrfs snapshots (Alpha)
reddit.com·3h·
Discuss: r/homelab
🔄Sync Engine
Flag this post
The Wayback Machine’s snapshots of news homepages plummet after a “breakdown” in archiving projects
niemanlab.org·1d·
Discuss: Hacker News
🌐Web Archives
Flag this post
Public Domain Software Libraries
goto10retro.com·9h
📼Cassette Archaeology
Flag this post
HTTPoetics Reflection
campuspress.yale.edu·13h
🏛Digital humanities
Flag this post
An Artefactual first: Double Release
artefactual.com·1d
🤖Archive Automation
Flag this post
Milestones in the History of PDF
pdfa.org·6h
📄PDF Archaeology
Flag this post
Retrieval-Augmented Generation for Web Archives: A Comparative Study of WARC-GPT and a Custom Pipeline
journal.code4lib.org·1d
🌐WARC Mining
Flag this post
Privacy Is Broken in Everyday Tools — But the Browser Can Fix It
hackernoon.com·20h
🛡️WebAssembly Security
Flag this post
Tool Demonstrations: The Great Digital Preservation Bake Off at iPRES 2025
ipres2025.nz·7h
📼Cassette Archaeology
Flag this post
AIxCC curl details
daniel.haxx.se·18h·
Discuss: Hacker News
🛡️WASM Sandboxing
Flag this post
TOLLBOOTH: What's yours, IIS mine
elastic.co·1d
🛡️eBPF Security
Flag this post
The FAIR Guiding Principles for scientific data management and stewardship
nature.com·1d·
Discuss: Hacker News
🏷️Metadata Standards
Flag this post
Web Archive 96: How the Smithsonian Helped Create One of the First Wayback Machine Collections
blog.archive.org·2d
💾Data Preservation
Flag this post
Reflecting on No Time To Wait 9: Common Questions for the Future of Open and Collaborative…
digitalpreservation-blog.lib.cam.ac.uk·1d
🏛️PREMIS
Flag this post
OpenAI Launches ChatGPT Atlas
pxlnv.com·21h
🔌Archive APIs
Flag this post
🎥 I Built a Professional YouTube Downloader with Python - Here's How!
dev.to·8h·
Discuss: DEV
🏺ZIP Archaeology
Flag this post
What it Means to be a Repository: Real, Trustworthy, or Mature?
journal.code4lib.org·1d
🏛️OAIS Implementation
Flag this post
ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents
arxiv.org·2d
📄OCR
Flag this post