Fine-Grained Detection of AI-Generated Text Using Sentence-Level Segmentation
arxiv.orgยท5h
๐Ÿ“„Document AI
What is NLP? How Does it Work?
dev.toยท1dยท
Discuss: DEV
๐Ÿ“Text Parsing
Behavioral Validity Checks for MLโ€‘Based "Coding"
gojiberries.ioยท3hยท
Discuss: Hacker News
๐Ÿง Intelligence Compression
Open Political Corpora: Structuring, Searching, and Analyzing Political Text Collections with PoliCorp
arxiv.orgยท5h
๐Ÿ“„Semantic Chunking
Accurate Thyroid Cancer Classification using a Novel Binary Pattern Driven Local Discrete Cosine Transform Descriptor
arxiv.orgยท5h
๐Ÿ“„OCR
Real, Fake, or Manipulated? Detecting Machine-Influenced Text
arxiv.orgยท1d
๐Ÿ”คCharacter Classification
HARE: an entity and relation centric evaluation framework for histopathology reports
arxiv.orgยท5h
โš™๏ธCompression Benchmarking
Graph Harmony: Harmonizing Global and Local Views for Superior Clustering
dev.toยท1dยท
Discuss: DEV
๐ŸŒŠSpectral Clustering
Extending Automatic Machine Translation Evaluation to Book-Length Documents
arxiv.orgยท5h
โš™๏ธCompression Benchmarking
Why Tables Are the Hardest Problem in Document AI
runpulse.comยท19hยท
Discuss: Hacker News
๐Ÿ“„Document AI
DragOSM: Extract Building Roofs and Footprints from Aerial Images by Aligning Historical Labels
arxiv.orgยท5h
๐Ÿ”ถVoronoi Diagrams
DocIQ: A Benchmark Dataset and Feature Fusion Network for Document Image Quality Assessment
arxiv.orgยท5h
๐Ÿค–Advanced OCR
Improving Zero-shot Sentence Decontextualisation with Content Selection and Planning
arxiv.orgยท5h
๐Ÿ“„Text Chunking
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System
arxiv.orgยท5h
๐ŸŽฏContent Recommendation
CommonForms: A Large, Diverse Dataset for Form Field Detection
arxiv.orgยท5h
๐Ÿ“„PDF Archaeology
The low-cost path to AI Mastery
antonyarkov.substack.comยท1dยท
Discuss: Substack
โšกProof Automation
SAM-DCE: Addressing Token Uniformity and Semantic Over-Smoothing in Medical Segmentation
arxiv.orgยท5h
๐Ÿ“ŠLearned Metrics