LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.to·20h·
Discuss: DEV
💾Cache Algorithms
Optimizing Code Cache Performance for Large Code Footprint Java Applications on Neoverse
community.arm.com·12m
Cache-Aware Algorithms
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·18h·
Discuss: r/LocalLLaMA
🗺️Region Inference
Semantic Dictionary Encoding
falvotech.com·15h·
Discuss: Hacker News
🗂️Type Indexing
GB/s Level Editable DOM JSON Engine: The Architectural Philosophy Behind LJSON
github.com·1h·
Discuss: DEV
📋JSON Parsing
Building High-Performance Caching in Go: A Practical Guide
dev.to·1d·
Discuss: DEV
🧠Memory Models
Building a Simple Stack-Based Virtual Machine in Go
blog.phakorn.com·23h·
📚Stack Data Structures
More hardware won’t fix bad engineering
infoworld.com·21h
🔮Branch Predictors
What Facebook's Memcache Taught Me About Systems Thinking
lorbic.com·13h·
Discuss: Hacker News
Cache-Aware Algorithms
AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
arxiv.org·2h
🔍ML Language
Identifying Divergences in HW Designs For High Performance Computing Workloads (LBNL et al.)
semiengineering.com·13h
Performance
Boost Windows 11 Performance: Clear Cache for Speed and Space
webpronews.com·16h
🧠Memory Consistency
The future of microoptimization
goldenstack.net·3d·
Discuss: Hacker News
🔬Nanopasses
Is Recursion in LLMs a Path to Efficiency and Quality?
pub.towardsai.net·6h
🪜Recursive Descent
H100 PCIe – 1.86 TB/s memcpy roofline and 8× uplift
news.ycombinator.com·2d·
Discuss: Hacker News
🧠Memory Hierarchy
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·11h·
🚀Tokenizer Performance
Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.org·2d·
Discuss: Hacker News
🔄Subinterpreters
Why some agentic AI developers are moving code from Python to Rust
developers.redhat.com·23h
Interpreter Optimization
Topaz_Gigapixel_AI_8.4.3.dmg
xmac.app·14h
📦Executable Size
Revel: My Experiment in Infinite, Portable Note-Taking with C and GTK4
velostudio.github.io·9h·
💬Smalltalk VMs