๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“ƒ Manuscript Tokenization

Medieval Text Processing, Paleographic Parsing, Historical NLP, Character Segmentation

Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages
arxiv.orgยท1d
๐Ÿค–Manuscript AI
davidchisnall/igk: I got Knuth'd: A compiler for documents
github.comยท16h
๐Ÿ“Concrete Syntax
I Figured Out What the Voynich Manuscript Says; It's Something More Than Words
dmerullo.substack.comยท7hยท
Discuss: Substack
๐ŸฐManuscript Networks
Why Your Next LLM Might Not Have A Tokenizer
towardsdatascience.comยท1d
๐Ÿค–Grammar Induction
Building an ML model to generate fonts
fontweaver.comยท1dยท
Discuss: Hacker News
๐Ÿ” Terminal Fonts
How to Prove That An Email Was Received
metaspike.comยท2h
๐Ÿ“„Document Digitization
The modern text processing pipeline: Overview
newroadoldway.comยท2dยท
Discuss: Lobsters, r/programming
๐Ÿ”คUnicode Normalization
Unveiling Factors for Enhanced POS Tagging: A Study of Low-Resource Medieval Romance Languages
arxiv.orgยท1d
๐Ÿ‘๏ธMedieval OCR
BNFGen: A random text generator based on context-free grammars
baturin.orgยท5hยท
Discuss: Hacker News
๐ŸŒณContext free grammars
Explaining software and computational methods
blog.khinsen.netยท22hยท
Discuss: Hacker News
๐Ÿ“Concrete Syntax
A Standard for Human-Centered Investigation Playbooks
chrissanders.orgยท3h
๐ŸŽฏThreat Hunting
Kumo Surfaces Structured Data Patterns Generative AI Misses
thenewstack.ioยท8h
๐Ÿ“ŠGraph Databases
Contextualizing SUTRA: Advancements in Multilingual & Efficient LLMs
hackernoon.comยท6h
๐Ÿ’ปLocal LLMs
Using Wavelets and Clustering to Predict Odd or Even Numbers: An Overengineered Approach with Pretty (But Confusing) Plots
dev.toยท8hยท
Discuss: DEV
๐Ÿง Machine Learning
Show HN: Towards agentic Graph RAG: Enhancing graph retrieval with vector search
blog.kuzudb.comยท1hยท
Discuss: Hacker News
๐Ÿ“ŠGraph Databases
Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated
arxiv.orgยท18h
๐ŸงฎKolmogorov Complexity
June 25, 2025 Flight Tracking Workshop (4 hour) [Americas / Europe-friendly time]
bellingcat.comยท22h
๐ŸงฎProlog Parsing
Capturing my handwriting in a searchable digital format โ€“ the long way round
colinramsay.co.ukยท1dยท
Discuss: Hacker News
๐Ÿ“ฒDigitization
Text2Struct: A Machine Learning Pipeline for Mining Structured Data from Text
arxiv.orgยท1d
๐Ÿ”คCharacter Classification
Driving cost-efficiency and speed in claims data processing with Amazon Nova Micro and Amazon Nova Lite
aws.amazon.comยท5h
๐ŸŒŠStream Processing
Loading...Loading more...
AboutBlogChangelogRoadmap