ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents
arxiv.orgยท1d
๐Ÿ“„OCR
Flag this post
Optimizing LLM Context for Vulnerability Scanning
blog.fraim.devยท15hยท
Discuss: Hacker News
๐ŸงชBinary Fuzzing
Flag this post
Word Processing: Heavy Metal Style
hackaday.comยท14h
๐Ÿ—ƒ๏ธPunched Cards
Flag this post
Automating Word Document Creation with Python: A Practical Guide
dev.toยท1dยท
Discuss: DEV
๐Ÿ“‹Document Layout
Flag this post
How to Use Frontier Vision LLMs: Qwen3-VL
towardsdatascience.comยท10h
๐Ÿ‘๏ธConstructive OCR
Flag this post
Deepseek's OCR system compresses image-based text so AI can handle much longer documents
the-decoder.comยท17h
๐Ÿค–Advanced OCR
Flag this post
AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu
arxiv.orgยท3h
๐Ÿ”คCharacter Classification
Flag this post
From multilingual semantic search to virtual assistants at Bosch Digital
stackoverflow.blogยท17h
๐Ÿ“šMARC Evolution
Flag this post
Gemini and I Wrote a Book: Introduction to Computational Linguistics
dubovik.euยท1dยท
Discuss: Hacker News
๐Ÿ“Concrete Syntax
Flag this post
Should LLMs just treat text content as an image?
seangoedecke.comยท7hยท
Discuss: Hacker News
๐Ÿค–Advanced OCR
Flag this post
Writing a tiny TrueType parser and renderer from scratch
yayo1.comยท2dยท
Discuss: Hacker News
๐Ÿ”คFont Archaeology
Flag this post
Unsupervised Learning NO. 503
newsletter.danielmiessler.comยท12h
๐Ÿ•ต๏ธVector Smuggling
Flag this post
A Token of My Affliction: The Hidden Pain Behind Every LLM
dev.toยท2dยท
Discuss: DEV
๐Ÿ“Text Parsing
Flag this post
Publication Trend Analysis and Synthesis via Large Language Model: A Case Study of Engineering in PNAS
arxiv.orgยท3h
๐Ÿ“ŠCitation Graphs
Flag this post
Show HN: MarkdownConverters โ€“ Convert any file format to clean Markdown
markdownconverters.comยท1dยท
Discuss: Hacker News
๐Ÿ”„Migration Tools
Flag this post
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
arxiv.orgยท3h
๐ŸงฎVector Embeddings
Flag this post
DocMind, Streamlit Application Leveraging LlamaIndex, LangGraph, and LLM
github.comยท12hยท
Discuss: Hacker News
๐Ÿค–Archive Automation
Flag this post
When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling
arxiv.orgยท1d
๐Ÿ”จCompilers
Flag this post
Intelligent Communication Mixture-of-Experts Boosted-Medical Image Segmentation Foundation Model
arxiv.orgยท3h
๐Ÿง Machine Learning
Flag this post