Web Archive Analysis, Internet Archaeology, Crawl Data, Historical Web

Feeds to Scour
SubscribedAll
Scoured 9583 posts in 3.92 s
Natural language processing for word sense disambiguation and information extraction
arxiv.org·14h·
Discuss: r/compsci
📥Feed Aggregation
Preview
Report Post
Kibana: Visualizing Your Data Story
dev.to·9h·
Discuss: DEV
🏺ZIP Archaeology
Preview
Report Post
Used-by: Context aware tech stack recommendations from crawled real world usage
news.ycombinator.com·11h·
Discuss: Hacker News
🎯Content Recommendation
Preview
Report Post
AI Has Made it Easy to Own Your Tools
jimmyhmiller.github.io·1d
🤖Archive Automation
Preview
Report Post
Archaeology — A Tool For Digging Into Binary Files on macOS
mothersruin.com·1h
🏺ZIP Archaeology
Preview
Report Post
faradayio/xsv2: Fork of xsv because qsv is too much
github.com·7h·
Discuss: Hacker News
🗜️LZSS Variants
Preview
Report Post
Tangible Media: A Historical Collection of Information Storage Technology
tangiblemediacollection.com·14h·
Discuss: Hacker News
🏺Media Archaeology
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy — TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.io·8h
📝Parsing Grammars
Preview
Report Post
Running Code and Failing Models – Rajiv Shah
projects.rajivshah.com·1d
🧠Machine Learning
Preview
Report Post
Anticipating Risk Before It Hits: Surbhi Gupta on the Future of Predictive Analytics
hackernoon.com·2d
🔍BitFunnel
Preview
Report Post
What Technologies Are Running Across 5.5M Websites (Nov 2025)
versiondb.io·16h·
Discuss: Hacker News
🧬Bitstream Evolution
Preview
Report Post
Climate Monitoring Search Engine: Multi-Vectors in Qdrant
pub.towardsai.net
·4d
🗂️Vector Search
Preview
Report Post
I got sick of keeping scraped data up to date, so I built this
meter.sh·1d·
Discuss: Hacker News
🔃Feed Algorithms
Preview
Report Post
Pandas vs Polars: Why the 2025 Evolution Changes Everything
dev.to·7h·
Discuss: DEV
🌀Brotli Internals
Preview
Report Post
Towards Ancient Plant Seed Classification: A Benchmark Dataset and Baseline Model
arxiv.org·4d
🤖Paleographic ML
Preview
Report Post
The History Began from AlexNet: A Comprehensive Survey on Deep LearningApproaches
dev.to·3h·
Discuss: DEV
👁️OCR Evolution
Preview
Report Post
Show HN: BrandRetina – screenshot similarity API for spear-phish detection
brandretina.ai·1d·
Discuss: Hacker News
🔗Binary Similarity
Preview
Report Post
Cluster expansion
reddit.com·3d·
Discuss: r/homelab
🔗Proxmox Clustering
Preview
Report Post
Redis Threading Model: Debunking the Single-Threaded Myth
redis.io·2d·
Discuss: DEV
Redis Internals
Preview
Report Post
aashirpersonal/semantic-coverage: Automated detection of knowledge gaps and blind spots in RAG vector stores.
github.com·3d·
Discuss: Hacker News
🌀Brotli Internals
Preview
Report Post