Document AI

Transformer Models, Layout Analysis, OCR Enhancement, Information Extraction

Feeds to Scour
SubscribedAll
Scoured 29 posts in 9.4 ms

Building TESSERACT-X: An AI-Powered 4D Simulation Engine in the Browser

 🤖Advanced OCR  Content type: Video
youtu.be··DEV

Tesseract Echo is a FREE multi-tap tape delay and reverb plugin

 📼Cassette Technology  Content type: Blog

HAIKU FROM THE HEART ZINE : Posthuman Auntie : Free Download, Borrow, and Streaming

 📝Punctuation Parsing
archive.org·

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

 📋Document Analysis  Content type: Academic
arxiv.org·

How I built an offline, privacy-first receipt scanner using Rust, Tauri, and WebAssembly OCR

 📄Document Digitization  Content type: Code
github.com··DEV

PyCoder’s Weekly: Issue #738: sleep(), Polars Workflows, Iterators, and More (2026-06-09)

 💻programming languages
pycoders.com·

Design a Knowledge Q&A System

 📄Text Chunking

Bitcoin's $63K Reclaim Liquidates $540M in Crypto Shorts, a 7-Week High

 🤖Advanced OCR  Content type: News
decrypt.co·

Show HN: Open Terminal – A Bloomberg Style App for Research

 🤖Advanced OCR

Build a Medical Report Analyzer on Dedicated Inference with Python

 🤖Advanced OCR
digitalocean.com·

Can LLMs extract scientific consensus? A case study in high-temperature superconductivity

 📄Text Mining  Content type: Academic
arxiv.org·

ferdinandobons/brand-docs: Turn a company’s Word, PowerPoint or Excel template into unlimited on-brand documents. Open-source Claude Code skill bundle that extracts a reusable Brand Profile and generates faithful .docx/.pptx/.xlsx — off-brand output impossible by construction.

 🤖Advanced OCR  Content type: Code

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

 📄Text Mining  Content type: Academic
arxiv.org·

SMADE-IE: Sparse Multi-Agent Framework with Evidence-Driven Debate for Zero-Shot Information Extraction

 📄Text Mining  Content type: Academic
arxiv.org·

ThomasRooyakkers/HomeHub: HomeHub is a personal home-management app designed for single-household use. It combines everyday household tools into one clean dashboard and can be run locally or self-hosted with Docker.

 🏠HomeLab  Content type: Code
github.com··r/SideProject

PereStruct: Multimodal Semantic Assembly for Robust Historical Document Parsing

 📋Document Analysis  Content type: Academic
arxiv.org·

Stress-testing medical large language models reveals latent safety pathology beyond benchmark accuracy

 📄Text Mining  Content type: Academic
arxiv.org·

End-to-End Text Line Detection and Ordering

 🤖Advanced OCR  Content type: Academic
arxiv.org·

On the Shoulders of Giants: Empowering Automated Smart Contract Auditing via the GiAnt Corpus

 📄Text Mining  Content type: Academic
arxiv.org·

QO-Bench: Diagnosing Query-Operator-Preserving Retrieval over Typed Event Tuples

 📄Text Chunking  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help