Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📄 Document Processing
Text Extraction, PDF Parsing, OCR, Data Ingestion
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
83091
posts in
245.3
ms
Precise Text and
Tabular
Data Extraction from
PDFs
in Python
dev.to
·
1d
·
Discuss:
DEV
🔍
RAG
How to Build a
Document
Processing Pipeline for RAG with
Nemotron
developer.nvidia.com
·
2d
🔍
RAG
How
Associa
transforms document classification with the GenAI
IDP
Accelerator and Amazon Bedrock
aws.amazon.com
·
1d
📊
Vector Databases
baottdang/semantic-doc-search-engine
: A cross‑modal search engine for
PDFs
and images, powered by a CNN‑based feature extraction pipeline.
github.com
·
1d
·
Discuss:
r/SideProject
🔍
RAG
What is
OCR
? (And 4 Real-World Use
Cases
)
dev.to
·
1d
·
Discuss:
DEV
📊
Vector Databases
OmniAI
OCR
Benchmark
getomni.ai
·
1d
📊
Vector Databases
Dolphin-v2
: Universal Document
Parsing
via Scalable Anchor Prompting
arxiv.org
·
1d
📊
Vector Databases
Best Data
Extraction
Software of 2026
theaisurf.com
·
2d
🔍
RAG
Nemotron
Labs
: How AI Agents Are Turning Documents Into Real-Time Business Intelligence
blogs.nvidia.com
·
2d
🤖
Ai
**Abstract:** This paper introduces Hyperdimensional Semantic Alignment for Ancient Text Restoration and
Contextualization
(
HASATRC
), a novel framework lever...
freederia.com
·
2d
🔍
RAG
Show HN: The
Librarian
creates artifacts while a model
finishes
its work
librarian.fieldtheory.dev
·
11h
·
Discuss:
Hacker News
🤖
Ai
Text classification with Python 3.14's
zstd
module • Max
Halford
maxhalford.github.io
·
1d
·
Discuss:
Lobsters
,
Hacker News
💬
Large Language Models
OpenAI and
Ginkgo
Bioworks
build an autonomous lab where GPT-5 calls the shots
the-decoder.com
·
16h
🤖
Ai
How
Grab
Built a Vision LLM to
Scan
Images
blog.bytebytego.com
·
3d
🔍
RAG
Google
reclassifies
as a Data Processor for
reCAPTCHA
thomasrigby.com
·
12h
🔍
RAG
Recreating Epstein PDFs from raw
encoded
attachments
neosmart.net
·
2d
·
Discuss:
Lobsters
,
Hacker News
🔍
RAG
Three
Investigative
Bottlenecks
– Three New Baseline Capabilities
forensicfocus.com
·
18h
🔍
RAG
New England
Biolabs
® Proudly Partners with Automation Providers to Support
NGS
Library Prep Automation Globally
prnewswire.com
·
14h
🤖
Ai
Google's
PaperBanana
: AI agent beats PhD experts at scientific
diagrams
ppc.land
·
13h
🤖
Ai
EgyPLI
: A Real-life
Annotated
Image Dataset for Egyptian Plant Leaf Identification
nature.com
·
19h
📊
Vector Databases
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help