🚀 Tokenizer Performance - abnv · Scour

Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation

arxiv.org·10h

🌊Streaming Lexers

Random Access in Grammar-Compressed Strings: Optimal Trade-Offs in Almost All Parameter Regimes

arxiv.org·1d

📐Succinct Data Structures

Show HN: OCR Arena – A playground for OCR models

news.ycombinator.com·1d·

Discuss: Hacker News

⚡Tokenizer Optimization

high error rate in text-embedding-3-small

status.openai.com·1d

🧪Parser Testing

Large Language Models for Mortals book

andrewpwheeler.com·2d

DeepSeek-V3.2 on GB300: Performance Breakthrough

blog.vllm.ai·15h

🗺️Region Inference

Built a Hybrid RAG API with FastAPI & Ollama – Sparse + Dense retrieval in action.

youtu.be·1d·

Discuss: DEV

🌳Pattern Match Compilation

Show HN: WavNav, a desktop app to explore and search large sample libraries

maxgraf.space·2h·

Discuss: Hacker News

🌳Parser Visualization

Show HN: Decoder – Static call graph analysis for Python

github.com·5h·

Discuss: Hacker News

📊Call Graph Analysis

🔑Beginner-Friendly Guide 'Longest Balanced Substring II' - Problem 3714 (C++, Python, JavaScript)

dev.to·6h·

Discuss: DEV

🔤String Algorithms

Programming languages

mothcodes.bearblog.dev·5h

🔬programming language theory

wordchipper - my next-gen LLM tokenizer; looking for LTR release help

docs.rs·3d·

Discuss: r/rust

🔤Language Tokenizers

A History of Large Language Models

gregorygundersen.com·1d

🪜Recursive Descent

Coding A PoS Tagger from Scratch — A Statistical Part-of-Speech Tagger | NLP

pub.towardsai.net

·3d

🔤Language Tokenizers

How Andrej Karpathy Built a Working Transformer in 243 Lines of Code

analyticsvidhya.com·1d

🪜Recursive Descent

Unleash your ideas with ASCII

monosketch.io·3h·

Discuss: Hacker News

💬Smalltalk VMs

Index Compression, Query Execution Improvements

marginalia.nu·15h

📊Query Optimizers

Scaling LLM Post-Training at Netflix

netflixtechblog.com·7h

🗺️Region Inference

Zvec: SQLite-like simplicity in an embedded vector database (By Alibaba)

zvec.org·1d·

Discuss: Hacker News

💾Minimal Databases

[AINews] Z.ai GLM-5: New SOTA Open Weights LLM

latent.space·1d

🏁Language Benchmarks

Loading more...