CPU Cache-Friendly Data Structures in Go: 10x Speed with Same Algorithm
skoredin.proยท2dยท
Discuss: Hacker News
โšกCache Optimization
Memory fragmentation? leak? in Rust/Axum backend
reddit.comยท3hยท
Discuss: r/rust
๐Ÿ”’Rust Borrowing
Memory leaks: the forgotten side of web performance (2022)
nolanlawson.comยท20hยท
Discuss: Hacker News
๐Ÿ”—Weak References
Want to see how your VPS stacks up under real load?
webdock.ioยท5hยท
Discuss: DEV
โšกPerformance
H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
arxiv.orgยท9h
๐Ÿ—บ๏ธRegion Inference
Latency vs. Accuracy for LLM Apps โ€” How to Choose and How a Memory Layer Lets You Win Both
dev.toยท1dยท
Discuss: DEV
๐ŸŽฎLanguage Ergonomics
Highly concurrent in-memory counter in GoLang
engineering.grab.comยท2d
๐Ÿง Memory Models
Walrus, A 1M ops/sec, 1 GB/s Write Ahead Log in Rust
nubskr.comยท22hยท
๐Ÿ’พZero-Copy
The Art of Abstraction โ€” Polymorphic Memory Allocator
unboxthecat.medium.comยท1dยท
Discuss: r/cpp
๐Ÿ—๏ธCustom Allocators
GPU Instanced Grass Breakdown
cyanilux.comยท1dยท
Discuss: Hacker News
๐Ÿ—๏ธCustom Allocators
We Bet on Rust to Supercharge Feature Store at Agoda
medium.comยท2hยท
๐Ÿš‚Cranelift Backend
We built a CUDA emulator that profiles GPU code with zero hardware
rightnowai.coยท1dยท
Discuss: Hacker News
๐Ÿ—๏ธCustom Allocators
The Next Computing Revolution: Bringing Processing Inside Memory
computer.orgยท1dยท
Discuss: Hacker News
๐Ÿง Memory Models
LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.ioยท1dยท
Discuss: Hacker News
๐Ÿ—บ๏ธRegion Inference
Souvenir
deprogrammaticaipsum.comยท2d
๐Ÿ”—Weak References
Over the Fence.... And Far Away....
megalomaniacbore.blogspot.comยท14h
๐ŸŽฏRing Buffers
10 Command-Line Tools Every Data Scientist Should Know
kdnuggets.comยท1h
๐ŸšShell Languages
What happened to Longcat models? Why are there no quants available?
huggingface.coยท1dยท
Discuss: r/LocalLLaMA
โœจGleam