Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ท๏ธ Web Scraping
Data Extraction, HTML Parsing, Automated Collection, Web Crawling
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112089
posts in
518.6
ms
Efficient
Crawling
for Scalable Web Data Acquisition (
Extended
Version)
arxiv.org
ยท
14h
๐
Feed Algorithms
Easily Harvest (
Scrape
) Web Pages โข
rvest
rvest.tidyverse.org
ยท
1d
๐
Feed Autodiscovery
A
general-purpose
library to add
Webmentions
to your website
lemmy.world
ยท
10h
๐
IndieWeb
Poisoning
scraperbots
with
iocaine
lwn.net
ยท
20h
ยท
Discuss:
Hacker News
๐
Indie Hacking
Extract
structured data from any website. Real-time search API with JSON,
Markdown
& HTML output.
searchresult.dev
ยท
3d
ยท
Discuss:
Hacker News
๐
Information Retrieval
Unleash
your ideas with
ASCII
monosketch.io
ยท
6h
ยท
Discuss:
Hacker News
๐ฅถ
Cold Start Problem
Love at First Data Point: Interactive Data Collection (
Stony
Brook
University Libraries)
rbfirehose.com
ยท
6h
๐
Digital Humanities
Building a PDF
Structured
Data
Extractor
with Python
dev.to
ยท
2h
ยท
Discuss:
DEV
๐ฐ
Feed Readers
jolovicdev/sourcery
: Schema-first LLM extraction framework with entity grounding, multi-pass extraction, and deterministic post-processing
github.com
ยท
20h
ยท
Discuss:
Hacker News
โก
Static Site Generators
Analytical
Search
arxiv.org
ยท
14h
๐
Information Retrieval
Turn natural language into
E2E
web app
tests
using AI
agenqa.com
ยท
6h
ยท
Discuss:
Hacker News
๐ฌ
Prompt Engineering
Show HN: AI-Powered
Structured
Data
Extraction
from Any Document (93%+ Accuracy)
news.ycombinator.com
ยท
1d
ยท
Discuss:
Hacker News
๐
TextRank
Addendum
: Data splitting against information leakage with
DataSAIL
nature.com
ยท
6h
๐ข
Kolmogorov Complexity
Automating
Codex
build.ms
ยท
1d
๐
Code Review
2026-02-13: Paper Summary: "High Fidelity Web
Archiving
of News Sites and New Media with
Browsertrix
"
ws-dl.blogspot.com
ยท
1h
ยท
Discuss:
ws-dl.blogspot.com
๐
Digital Archiving
ever
wonder
how
engines
work?
iwebthings.joejenett.com
ยท
2h
๐
Personal Wikis
Show HN:
WavNav
, a desktop app to explore and search large
sample
libraries
maxgraf.space
ยท
5h
ยท
Discuss:
Hacker News
๐
Digital Humanities
Stop Silent Failures: Using LLMs to
Validate
Web
Scraper
Output
dev.to
ยท
4d
ยท
Discuss:
DEV
๐
Feed Autodiscovery
Projects
drehmflight.com
ยท
3h
๐ป
Creative Coding
R. S.
Doiel
, Software
Engineer/Analyst
rsdoiel.github.io
ยท
15m
๐
Information Retrieval
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help