Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
🔁Cache Coherence
Flag this post
How to debug a 200ms+ ‘System (self)’ task with no visible subtasks in Chrome Performance trace?
📋Zero-Copy
Flag this post
Does Go's garbage collector use Depth-First Search (DFS) or Breadth-First Search (BFS) during the scan/marking phase?
🗄️Database Internals
Flag this post
A Friendly Tour of Process Memory on Linux
📋Zero-Copy
Flag this post
Low-Level Hacks
⚙️Systems Programming
Flag this post
Inside Pinecone: Slab Architecture
🗄️Database Internals
Flag this post
How to Design Efficient Memory Architectures for Agentic AI Systems
pub.towardsai.net·9h
🔁Cache Coherence
Flag this post
Rodrigo Girão Serrão: A generator, duck typing, and a branchless conditional walk into a bar
mathspp.com·7h
🔢algo
Flag this post
Building blobd: single-machine object store with sub-millisecond reads and 15 GB/s uploads
🗄️Database Internals
Flag this post
How to build a Heapless Vector using `MaybeUninit<T>` for Better Performance.
🦀Rust Macros
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·23h
⚡Performance Engineering
Flag this post
Loading...Loading more...