📝 Text Parsing - matmat · Scour

📝 Text Parsing

Token Processing, Grammar Rules, AST Generation, Language Recognition

Mastering NLP with spaCy – Part 2

towardsdatascience.com·3d

🌲Parse Trees

You know more Finnish than you think

dannybate.com·5h·

Discuss: Hacker News

🇸🇪Nordic Algorithms

LLGuidance: Making Structured Outputs Go Brrr

guidance-ai.github.io·4d·

Discuss: Hacker News

📝Concrete Syntax

Building an AI Tokenization Demo: From Workshop to App

dev.to·12h·

Discuss: DEV

🌀Brotli Internals

DACTYL: Diverse Adversarial Corpus of Texts Yielded from Large Language Models

arxiv.org·20h

🏛Digital humanities

Cactus Language • Pragmatics 9

inquiryintoinquiry.com·10h

📝Concrete Syntax

Extensions and Shadows (9)

sites.psu.edu·10h

✨Effect Handlers

SAT Requires Exhaustive Search

link.springer.com·3h·

Discuss: Hacker News

🧮Kolmogorov Complexity

LLMs - Embeddings 01

dev.to·18h·

Discuss: DEV

🧮Vector Embeddings

Briefly explained: What's behind the buzzword AI agents

heise.de·13h

Abhinav Sarkar: A Bytecode VM for Arithmetic: The Parser

abhinavsarkar.net·3d

🔗Functional Compilers

Using Dspy to Detect Document Boundaries

kmad.ai·1d·

Discuss: Hacker News

📄Document Digitization

Using LLM Embeddings to Normalize User Data

matthodges.com·2d·

Discuss: Hacker News

🔤Character Classification

The Revolution of Token-Level Rewards

levroai.com·9h·

Discuss: Hacker News

⚡Incremental Computation

I built a collection of simple Python projects for beginners (CLI,GUI,Web,API)

github.com·2h·

Discuss: Hacker News

🌀Brotli Internals

Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey

arxiv.org·3d

One-fifth of computer science papers may include AI content

science.org·14h·

Discuss: Hacker News

🤖AI Curation

Do LLMs produce texts with "human-like" lexical diversity?

arxiv.org·20h

🤖Grammar Induction

Machine Learning Fundamentals: machine learning

dev.to·1d·

Discuss: DEV

🧠Machine Learning

TokenSpan: Rethinking Prompt Compression with Aliases and Dictionary Encoding

dev.to·2h·

Discuss: DEV

📝Concrete Syntax

Loading more...