Beating the L1 cache with value speculation (2021)
mazzo.li·7h·
Discuss: Lobsters
🔮Branch Predictors
CPU Cache-Friendly Data Structures in Go: 10x Speed with Same Algorithm
skoredin.pro·16h·
Discuss: Hacker News
Cache Optimization
LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.io·7h·
Discuss: Hacker News
🗺️Region Inference
A Primer on Memory Consistency and Cache Coherence, Second Edition
link.springer.com·1d·
Discuss: r/programming
🧠Memory Models
The Next Computing Revolution: Bringing Processing Inside Memory
computer.org·2h·
Discuss: Hacker News
🧠Memory Models
How to run LLMs on a 1GB (e-waste) GPU without changing a single line of code
reddit.com·11h·
Discuss: r/LocalLLaMA
Cache Optimization
Highly concurrent in-memory counter in GoLang
engineering.grab.com·23h
🧠Memory Models
Algorithms For Black-Box, Physical-to-DRAM Address-Mapping Recovery (Georgia Tech, CNRS, Et Al.)
semiengineering.com·2h
🏷️Memory Tagging
Caching in Vector Database: What You Need to Know
dev.to·14h·
Discuss: DEV
Cache Optimization
Achieving 1.2 TB/s Aggregate Bandwidth by Optimizing Distributed Cache Network
juicefs.com·1d·
Discuss: Hacker News
🌍HTTP Servers
Reflections on Designing a Search Autocomplete System
torontostudygroup.github.io·1d·
Discuss: DEV
🌿Trie Structures
Souvenir
deprogrammaticaipsum.com·20h
🔗Weak References
I built a faster singleflight implementation for Go (zero allocations, ~4x faster than std)
reddit.com·8h·
Discuss: r/golang
💾Zero-Copy
Rigorous Evaluation of Microarchitectural Side-Channels with Statistical Model Checking
arxiv.org·19h
📱Bytecode Design
Speeding Up Data Decompression with nvCOMP and the NVIDIA Blackwell Decompression Engine
developer.nvidia.com·23h
💾Zero-Copy
Beyond the Single-Writer Limitation with Turso's Concurrent Writes
turso.tech·23h·
Discuss: Hacker News
🧠Memory Models
[NodeBook] Memory Fragmentation and Buffer Coding Challenges
thenodebook.com·3d·
Discuss: r/node
🏟️Arena Allocation
Solving Reproducibility Challenges in Deep Learning and LLMs: Our Journey
ingonyama.com·2d·
Discuss: Hacker News
🗺️Region Inference
Thundering Herd Problem (Cache Stampede): Solutions & Prevention | System Design Tutorial
howtech.substack.com·1d·
Discuss: r/programming
🔗Weak References
A case for learning GPU programming with a compute-first mindset – Maister's Gr
themaister.net·11h·
🔗Language Toolchains