DiffuSpec: Unlocking Diffusion Language Models for Speculative Decoding
arxiv.orgΒ·17h
⚑Tokenizer Optimization
πŸŽ™οΈ Building an AI-Powered Interview Analyzer on GCP
dev.toΒ·5hΒ·
Discuss: DEV
πŸ”„Incremental Parsers
LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.ioΒ·5hΒ·
Discuss: Hacker News
πŸ—ΊοΈRegion Inference
TrueType rasterizer
github.comΒ·2hΒ·
✨Code Formatters
Eclectic English Vocab
404wolf.comΒ·20h
πŸ”„Incremental Lexing
Claude Sonnet vs GLM 4.6: A Token Efficiency Comparison
reddit.comΒ·1dΒ·
Discuss: r/ClaudeAI
⚑Tokenizer Benchmarks
Fun with HyperLogLog and SIMD
vaktibabat.github.ioΒ·2dΒ·
πŸ”’Bit Manipulation
Introducing OpenZL: An Open Source Format-Aware Compression Framework
engineering.fb.comΒ·5hΒ·
πŸ“¦Compression Algorithms
Technical Explanations Why LLMs Use Em Dashes
msukhareva.substack.comΒ·1dΒ·
Discuss: Substack
πŸ”„Incremental Tokenizers
Writing a Dictation Application
osada.blogΒ·1d
πŸ“šSelf-Documenting Code
Β΅s Human-Readable IDs: A Performance Journey
dev.toΒ·8hΒ·
Discuss: DEV
πŸ“‹JSON Parsing
What happened to Longcat models? Why are there no quants available?
huggingface.coΒ·3hΒ·
Discuss: r/LocalLLaMA
✨Gleam
Language Support for Marginalia Search
marginalia.nuΒ·21h
πŸ”Text Indexing
Python PEP 636 – Structural Pattern Matching: Tutorial
peps.python.orgΒ·1dΒ·
Discuss: Hacker News
πŸ’¬Interactive REPLs
Solving Reproducibility Challenges in Deep Learning and LLMs: Our Journey
ingonyama.comΒ·2dΒ·
Discuss: Hacker News
πŸ—ΊοΈRegion Inference
How OpenAI Uses Kubernetes And Apache Kafka for GenAI
blog.bytebytego.comΒ·6h
πŸ“‘Erlang BEAM
What Happens Behind the Scenes When You Run Python Code
fusion-institute.comΒ·12hΒ·
Discuss: DEV
πŸ”§Error Recovery
Day 24 of My 90 Days Python Series – Word Counter Tool
github.comΒ·10hΒ·
Discuss: DEV
πŸ’¬Interactive REPLs
An alternative to knowledge graphs for storing loosely structured content
fleetingswallow.comΒ·1dΒ·
Discuss: Hacker News
🌲Tree Rewriting