Understanding the KV Cache (feat. Self-Attention)
dev.to·8h·
Discuss: DEV
🔄Subinterpreters
Blogpost: A Mental Model for GPU Engineering for LLMs
modelcraft.substack.com·2d·
Discuss: Substack
🧠Memory Models
Why We Created Turso, a Rust-Based Rewrite of SQLite
thenewstack.io·9m
💾Minimal Databases
The Best Performance Optimization Is Sometimes Changing Your Architecture
reddit.com·1d·
Discuss: r/webdev
🚀Code Motion
State of the Art of AI Tools in Micro-Frontend Architectures • Luca Mezzalira • GOTO 2025
youtube.com·3h
💬Smalltalk VMs
Index-mcp native Rust
github.com·2h·
Discuss: r/rust
🚂Cranelift Backend
Simple LLM VRAM calculator for model inference
bestgpusforai.com·2d·
Discuss: Hacker News
🗺️Region Inference
Solving Reproducibility Challenges in Deep Learning and LLMs: Our Journey
ingonyama.com·2d·
Discuss: Hacker News
🗺️Region Inference
The Role of AI in Next-Gen Chip Design
dev.to·15h·
Discuss: DEV
🔌Microcontrollers
WASM in the Kernel: Tales of Triumph and Trouble
riptides.io·1h·
Discuss: Hacker News
🌐WASM Runtimes
µs Human-Readable IDs: A Performance Journey
dev.to·2h·
Discuss: DEV
📋JSON Parsing
Is Odin Just a More Boring C?
dayvster.com·4h·
Discuss: Hacker News
🐹Go Internals
Gabriele Bartolini: CNPG Recipe 22 - Leveraging the New Supply Chain and Image Catalogs
gabrielebartolini.it·4h
🗑️Stack Scanning GC
A Global Mining Dataset
tech.marksblogg.com·4h·
Discuss: Hacker News
📈Earley Parsing
Predictive Coding Light
nature.com·15h
🗺️Region Inference
Medium Android App — Migrating from Apollo Kotlin 3 to 4: Lessons Learned
medium.engineering·6h
🔗Weak References
Why We Need SIMD
parallelprogrammer.substack.com·11h·
Discuss: Substack
🔀SIMD Programming
Bulk operations in Boost.Bloom
bannalia.blogspot.com·1d·
🌸Bloom Indexing
Design Principle: Composable Services
sleepingpotato.com·1h·
Discuss: Hacker News
🔀Control Structures