Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔤 Character Classification
Unicode Processing, Character Sets, Text Parsing, SMT Applications
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
28263
posts in
32.1
ms
Byte-Pair
Encoding
en.wikipedia.org
·
3h
·
Discuss:
Hacker News
📝
Text Compression
Using AI to Clean Up
OCR
Output
aboutranslation.com
·
11h
·
Discuss:
aboutranslation.com
✏️
OCR Correction
douglas-larocca/name-classifier
: A high-performance name classifier that
infers
probabilistic attributes about a person from their name alone.
github.com
·
17h
·
Discuss:
Hacker News
🧠
Machine Learning
MoDora
: Tree-Based
Semi-Structured
Document Analysis System
arxiv.org
·
2d
🏰
Medieval Parsing
VLM
Validation
and Document Intelligence
pub.towardsai.net
·
1d
✏️
OCR Correction
Unifying
Arabic
topolects
through AI
languagelog.ldc.upenn.edu
·
15h
🤖
AI Translation
Visual
catalog
of
Isotype
examples
flowingdata.com
·
2d
🔤
Font Archaeology
Closing the gap on tabular data with
Fourier
and Implicit
Categorical
Features
arxiv.org
·
2d
🧠
Learned Indexing
Current language model training leaves large
parts
of the internet on the
table
the-decoder.com
·
23h
📄
Text Chunking
Show HN:
AxonML
– A
PyTorch-equivalent
ML framework written in Rust
github.com
·
12h
·
Discuss:
Hacker News
🌀
Brotli Internals
Meet
M6
: The Chinese AI That
Understands
Text and Images at Scale
hackernoon.com
·
20h
🇨🇳
Chinese Computing
FireRedTeam/FireRed-OCR
huggingface.co
·
4h
📄
OCR
STRinGS
: Selective Text
Refinement
in Gaussian Splatting
strings-official.github.io
·
2d
·
Discuss:
Hacker News
🤖
Advanced OCR
Dictionary
of Algorithms and Data
Structures
xlinux.nist.gov
·
3d
·
Discuss:
Lobsters
🌳
Trie Structures
Swapping
NULL for
NUL
- a better way to find nothing!
research.exoticsilicon.com
·
1d
🧠
Lisp Dialects
The Chinese Computer:
Competition
or
Cooperation
?
languagelog.ldc.upenn.edu
·
5h
🇨🇳
Chinese Computing
What is a
token
jenniferplusplus.com
·
3d
📝
Text Parsing
Cute-Symbol
– A lightweight, zero-ad character
picker
for developers
cute-symbol.com
·
1d
·
Discuss:
Hacker News
🔠
Terminal Fonts
Building
Production-Grade
RAG Systems for
Document
AI: What It Actually Takes
hackernoon.com
·
1d
📄
Document AI
Laravel
OCR
& Document Data
Extractor
- A powerful
OCR
and document parsing engine for Laravel
packagist.org
·
1d
·
Discuss:
r/SideProject
,
r/programming
📜
Manuscript Digitization
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help