Feeds to Scour
SubscribedAll
Scoured 9576 posts in 1.98 s
Natural language processing for word sense disambiguation and information extraction
arxiv.orgยท19hยท
Discuss: r/compsci
๐Ÿ“ฅFeed Aggregation
Preview
Report Post
The art of text (rendering) (39c3)
cdn.media.ccc.deยท12h
๐Ÿ–‹Typography
Preview
Report Post
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
dev.toยท15hยท
Discuss: DEV
๐Ÿค–Advanced OCR
Preview
Report Post
mehdigmira/tablereader: Automatically extract clean, typed data from messy Excel and CSV files using LLM-powered table detection.
github.comยท6hยท
Discuss: Hacker News
๐Ÿ“šLempel-Ziv
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy โ€” TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.ioยท12h
๐Ÿ“Parsing Grammars
Preview
Report Post
Document Parsing with LLMs: From OCR to Structural Understanding.
alamedadev.comยท3d
๐Ÿ“‹Document Grammar
Preview
Report Post
How OCR Impacts the Accuracy of Document Translation
dev.toยท11hยท
Discuss: DEV
โœ๏ธOCR Correction
Preview
Report Post
ngPDF version 2.12.0 is released
duallab.comยท3d
๐Ÿ“„PDF Internals
Preview
Report Post
Building a PDF Ingestion Pipeline with TypeScript, Wasp, and AI OCR
dev.toยท3dยท
Discuss: DEV
๐Ÿ“„Document Streaming
Preview
Report Post
Stanford CS 224N | Natural Language Processing with Deep Learning
web.stanford.eduยท1d
๐Ÿง Machine Learning
Preview
Report Post
CoSeNet: A Novel Approach for Optimal Segmentation of Correlation Matrices
arxiv.orgยท2d
๐Ÿง Machine Learning
Preview
Report Post
AI assisted feature detection and LiDAR in archaeological heritage management [Elektronisk resurs]
libris.kb.seยท5d
๐ŸบComputational Archaeology
Preview
Report Post
leedrake5/unredact: A simple tool for reading in poorly redacted documents and reproducing their origional form
github.comยท4dยท
Discuss: Hacker News
๐Ÿ“„PostScript
Preview
Report Post
<p>**Abstract:** The escalating sophistication of malware necessitates advanced detection techniques beyond signature-based or heuristic approaches. We introduc...
freederia.comยท3d
๐Ÿฆ Malware Analysis
Preview
Report Post
I built a production-ready document parser for RAG apps that actually handles complex tables (full tutorial + code)
dev.toยท1dยท
Discuss: DEV
๐Ÿ“‹Document Grammar
Preview
Report Post
SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance
arxiv.orgยท2d
๐Ÿ“‹Document Grammar
Preview
Report Post
Automating Image Extraction from DOCX Files with Python
dev.toยท5dยท
Discuss: DEV
๐Ÿ“„Document Digitization
Preview
Report Post
MCAP Indexing โ€” Monday Morning Haskell
mmhaskell.comยท5d
๐Ÿ”„Burrows-Wheeler
Preview
Report Post
A case study in PDF forensics: The Epstein PDFs
pdfa.orgยท5dยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
Preview
Report Post