Document AI

Transformer Models, Layout Analysis, OCR Enhancement, Information Extraction

Feeds to Scour
SubscribedAll
Scoured 11 posts in 24.5 ms

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

 📋Document Analysis  Content type: Academic
arxiv.org·

PdfPig C# Review: A Focused Open-Source PDF Library in 2026

 📋Document Layout
hackernoon.com·

Rpdfium: Ruby implementation of Pdfium, Chrome's PDF engine

 🖋Typography  Content type: Code
github.com··Hacker News

Show HN: Open Terminal – A Bloomberg Style App for Research

 🤖Advanced OCR

Field Notes From The AI Battlefield

 📊Static Analysis
taoofmac.com··Hacker News

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

 📄Text Mining  Content type: Academic
arxiv.org·

Stress-testing medical large language models reveals latent safety pathology beyond benchmark accuracy

 📄Text Mining  Content type: Academic
arxiv.org·

SMADE-IE: Sparse Multi-Agent Framework with Evidence-Driven Debate for Zero-Shot Information Extraction

 📄Text Mining  Content type: Academic
arxiv.org·

On the Shoulders of Giants: Empowering Automated Smart Contract Auditing via the GiAnt Corpus

 📄Text Mining  Content type: Academic
arxiv.org·

End-to-End Text Line Detection and Ordering

 🤖Advanced OCR  Content type: Academic
arxiv.org·

QO-Bench: Diagnosing Query-Operator-Preserving Retrieval over Typed Event Tuples

 📄Text Chunking  Content type: Academic
arxiv.org·

No more posts from matmat's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help