๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“ Text Parsing

Token Processing, Grammar Rules, AST Generation, Language Recognition

BNFGen: A random text generator based on context-free grammars
baturin.orgยท26mยท
Discuss: Hacker News
๐ŸŒณContext free grammars
Contextualizing SUTRA: Advancements in Multilingual & Efficient LLMs
hackernoon.comยท2h
๐Ÿ’ปLocal LLMs
Cactus Language โ€ข Syntax 12
inquiryintoinquiry.comยท2h
๐Ÿ“Concrete Syntax
Stop Words Using Spacy - NLP
dev.toยท1dยท
Discuss: DEV
๐Ÿ“ผTape Linguistics
The Bitter Lesson is coming for Tokenization
lucalp.devยท1dยท
Discuss: Lobsters, Hacker News, r/programming
๐Ÿ”—Monadic Parsing
davidchisnall/igk: I got Knuth'd: A compiler for documents
github.comยท11h
๐Ÿ“Concrete Syntax
Multimodal Political Bias Identification and Neutralization
arxiv.orgยท1d
๐Ÿค–Advanced OCR
LR(1) parse-tables generator
github.comยท1dยท
Discuss: Lobsters, Hacker News
๐Ÿ”Z3 Parsing
June 25, 2025 Flight Tracking Workshop (4 hour) [Americas / Europe-friendly time]
bellingcat.comยท18h
๐ŸงฎProlog Parsing
Multilingual Tokenization through the Lens of Indian Languages: Challenges and Insights
arxiv.orgยท1d
๐Ÿค–Automated Parsing
Kumo Surfaces Structured Data Patterns Generative AI Misses
thenewstack.ioยท4h
๐Ÿ“ŠGraph Databases
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning
arxiv.orgยท1d
๐Ÿ“Concrete Syntax
Semantic-Aware Parsing for Security Logs
arxiv.orgยท1d
๐Ÿ“Log Parsing
Practical tips to optimize documentation for LLMs, AI agents, and chatbots
biel.aiยท23hยท
Discuss: Hacker News
๐Ÿค–Archive Automation
Driving cost-efficiency and speed in claims data processing with Amazon Nova Micro and Amazon Nova Lite
aws.amazon.comยท1h
๐ŸŒŠStream Processing
Why Your Next LLM Might Not Have A Tokenizer
towardsdatascience.comยท22h
๐Ÿค–Grammar Induction
Deep Dive into Databend UDF, implementing your data solutions with Python, WASM
databend.comยท3hยท
Discuss: Hacker News
๐Ÿ“‹DFDL
Text2Struct: A Machine Learning Pipeline for Mining Structured Data from Text
arxiv.orgยท1d
๐Ÿ”คCharacter Classification
Build a Sentence-Level Text-to-Speech Reader in JavaScript
jsdev.spaceยท2dยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
End-to-End Spoken Grammatical Error Correction
arxiv.orgยท1d
๐Ÿ”Z3 Parsing
Loading...Loading more...
AboutBlogChangelogRoadmap