Document Digitization

Feeds to Scour
SubscribedAll
Scoured 37 posts in 18.2 ms

alibaba/open-code-review: Battle-tested at Alibaba's scale. Hybrid architecture code review tool: deterministic pipelines + LLM Agent, precise line-level comments, built-in fine-tuned ruleset (NPE, thread-safety, XSS, SQL injection), OpenAI & Anthropic compatible.

 📝Punctuation Parsing  Content type: Code
github.com··Hacker News

Using Apple native OCR to turn recurring workflows into skills [video]

 📝Punctuation Parsing  Content type: Video
youtube.com··Hacker News

RealDocBench: A Benchmark for Field-Level QA and Layout Understanding on Real-World Regulated Documents

 📝Punctuation Parsing  Content type: Academic
arxiv.org·

Read vehicle license plates this API gives you 2,500 free reads per month

 📝Punctuation Parsing

The `pdf` Claude Code Skill: Create, Fill, and Manipulate PDFs with AI

 🤖Advanced OCR  Content type: Blog
jonathansblog.co.uk·

Unix World Vol02.10 : Unix World : Free Download, Borrow, and Streaming

 📄OCR  Content type: PDF
archive.org··Hacker News

New Issue: Collections

 🔄Archival Workflows

A Practical Security Architecture for Retrieval-Augmented Generation

 📄Semantic Chunking
hackernoon.com·

Preservica takes AI for Digital Preservation to the next level with powerful new AI Editions

 ❄️Nordic Preservation
preservica.com·

REST API for Parsing Swiss QR-Bill PDFs

 📝Punctuation Parsing
billscan.ch··Hacker News

Trump’s justice department is weaponizing civil rights laws against students of color | ReNika Moore

 📝Punctuation Parsing  Content type: News
theguardian.com·

Show HN: A beautiful and local-first PDF reader for studying dense things

 🔌Offline-first Apps

TeamHerald@CHIPSAL 2026: Hate Speech Detection and Sentiment Analysis of Nepali Memes using Transformer-based Architectures and Ensemble Learning

 📝Punctuation Parsing  Content type: Academic
arxiv.org·

POTATR: A Lightweight Image-to-Graph Model for Page-Level Table Extraction

 🌳B+ Tree Splits  Content type: Academic
arxiv.org·

MUDIDI: A Two-Stage Framework for Multilingual Dictionary Digitization with Language Models

 📝Punctuation Parsing  Content type: Academic
arxiv.org·

Handwriting Extraction and Analysis of Signature Lists in Swiss Popular Initiatives

 👁️OCR Verification  Content type: Academic
arxiv.org·

Vision Language Model Helps Private Information De-Identification in Vision Data

 📄OCR  Content type: Academic
arxiv.org·

Baichuan-M4: A Clinical-Grade Medical Agent System for Continuous Care

 📝Punctuation Parsing  Content type: Academic
arxiv.org·

Real-Time Automatic License Plate Recognition Using YOLOv8, SORT Tracking, and Temporal Data Interpolation

 📄OCR  Content type: Academic
arxiv.org·

Multimodal Sexism Identification and Characterization using Large Language Models and Gradient Boosting

 🧪Data science  Content type: Academic
arxiv.org·

No more posts from matmat's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help