Extracting A Large Corpus from the Internet Archive, A Case Study
journal.code4lib.org·1d
🏺ZIP Archaeology
Flag this post
Safeguarding the Past for the Future: International Training on Digitisation, Preservation, and Emergency Response
ica.org·7h
💾Data Preservation
Flag this post
[Project] termiNAS: Self-hosted storage server with ransomware protection via Btrfs snapshots (Alpha)
🔄Sync Engine
Flag this post
The Wayback Machine’s snapshots of news homepages plummet after a “breakdown” in archiving projects
🌐Web Archives
Flag this post
Public Domain Software Libraries
goto10retro.com·9h
📼Cassette Archaeology
Flag this post
HTTPoetics Reflection
campuspress.yale.edu·13h
🏛Digital humanities
Flag this post
An Artefactual first: Double Release
artefactual.com·1d
🤖Archive Automation
Flag this post
Milestones in the History of PDF
pdfa.org·6h
📄PDF Archaeology
Flag this post
Retrieval-Augmented Generation for Web Archives: A Comparative Study of WARC-GPT and a Custom Pipeline
journal.code4lib.org·1d
🌐WARC Mining
Flag this post
Privacy Is Broken in Everyday Tools — But the Browser Can Fix It
hackernoon.com·20h
🛡️WebAssembly Security
Flag this post
Tool Demonstrations: The Great Digital Preservation Bake Off at iPRES 2025
ipres2025.nz·7h
📼Cassette Archaeology
Flag this post
AIxCC curl details
🛡️WASM Sandboxing
Flag this post
TOLLBOOTH: What's yours, IIS mine
elastic.co·1d
🛡️eBPF Security
Flag this post
The FAIR Guiding Principles for scientific data management and stewardship
🏷️Metadata Standards
Flag this post
Web Archive 96: How the Smithsonian Helped Create One of the First Wayback Machine Collections
blog.archive.org·2d
💾Data Preservation
Flag this post
Reflecting on No Time To Wait 9: Common Questions for the Future of Open and Collaborative…
digitalpreservation-blog.lib.cam.ac.uk·1d
🏛️PREMIS
Flag this post
OpenAI Launches ChatGPT Atlas
pxlnv.com·21h
🔌Archive APIs
Flag this post
What it Means to be a Repository: Real, Trustworthy, or Mature?
journal.code4lib.org·1d
🏛️OAIS Implementation
Flag this post
ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents
arxiv.org·2d
📄OCR
Flag this post
Loading...Loading more...