A Primer on Memory Consistency and Cache Coherence, Second Edition
link.springer.com·2d·
Discuss: r/programming
Cache Theory
Beating the L1 cache with value speculation (2021)
mazzo.li·1d·
CPU Microarchitecture
H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
arxiv.org·2h
💨Cache Optimization
The Next Computing Revolution: Bringing Processing Inside Memory
computer.org·1d·
Discuss: Hacker News
Hardware Transactional Memory
Latency vs. Accuracy for LLM Apps — How to Choose and How a Memory Layer Lets You Win Both
dev.to·19h·
Discuss: DEV
Performance Mythology
LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.io·1d·
Discuss: Hacker News
💻Local LLMs
Fast Matrix Multiply on an Apple GPU
percisely.xyz·9h·
Discuss: Hacker News
SIMD Vectorization
Walrus, A 1M ops/sec, 1 GB/s Write Ahead Log in Rust
nubskr.com·15h·
💿ZFS Internals
CPU Cache-Friendly Data Structures in Go: 10x Speed with Same Algorithm
skoredin.pro·1d·
Discuss: Hacker News
💨Cache Optimization
Tiga: Accelerating Geo-Distributed Transactions with Synchronized Clocks
muratbuffalo.blogspot.com·2h·
🤝Distributed Consensus
Static Bundle Object: Modernizing Static Linking
medium.com·15h·
Discuss: Hacker News
🔗Static Linking
Achieving 1.2 TB/s Aggregate Bandwidth by Optimizing Distributed Cache Network
juicefs.com·2d·
Discuss: Hacker News
📡Network Stack
I3-12100 Jellyfin/torrent NAS build review
reddit.com·13h·
Discuss: r/homelab
🗄️SQLite Internals
Beyond the Single-Writer Limitation with Turso's Concurrent Writes
turso.tech·2d·
📝SQLite WAL
Hardware Stockholm Syndrome
programmingsimplicity.substack.com·1d·
Discuss: Substack
🔩Systems Programming
We built a CUDA emulator that profiles GPU code with zero hardware
rightnowai.co·1d·
Discuss: Hacker News
🎯Emulator Accuracy
State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing
danglingpointers.substack.com·15h·
Discuss: Substack
📡Network Stack
Design for Chaos: Fastly’s Principles of Fault Isolation and Graceful Degradation
fastly.com·1d
🛡️Error Boundaries
Show HN: Sovant – Memory that works across OpenAI, Claude and Gemini
sovant.ai·14h·
Discuss: Hacker News
Hardware Transactional Memory