💾 Cache Algorithms - abnv · Scour

DualMap: Enabling Both Cache Affinity and Load Balancing for Distributed LLM Serving

arxiv.org·1d

🧠Memory Consistency

Performance Tip of the Week #83: Reducing memory indirections

abseil.io·2d

📦Compact Data

Concurrency Deep Dive: Memory Models, Lock-Free, and RCU

dev.to·2d·

Discuss: DEV

🧠Memory Models

KV-CoRE: Benchmarking Data-Dependent Low-Rank Compressibility of KV-Caches in LLMs

arxiv.org·4d

⚡Cache-Aware Algorithms

Caching in 2026: Fundamentals, Invalidation, and Why It Matters More Than Ever

lukasniessen.medium.com·1d·

Discuss: r/node

⚡Cache Optimization

Faster AI Training Unlocked With New System For Massive Language Models

quantumzeitgeist.com·17h

⚡Tokenizer Optimization

Hitting 1,000 tokens per second on a single RTX 5090

blog.alpindale.net·1d·

Discuss: Hacker News, Hacker News

🎯Ring Buffers

SAE Feature Matchmaking (Layer-to-Layer) by Mitali M

greaterwrong.com·3h

🔢Algebraic Datatypes

Testing a 6200 and comparison with 6100

68kmla.org·3h

⏱️Real-Time GC

A Note on Flat Abstract Syntax Trees

gist.github.com·13h·

Discuss: Hacker News

🌳Tree Walking

Performance Tip of the Week #62: Identifying and reducing memory bandwidth needs

abseil.io·2d

⚡Cache-Aware Algorithms

Main Content || Math ∩ Programming

jeremykun.com·1d

🔢Algebraic Datatypes

Linker Script Generation for Firmware Projects: A Primer

dnedic.github.io·12h·

Discuss: Hacker News, r/embedded

🔗Language Toolchains

tzcnt/TooManyCooks: C++20 concurrency framework with no compromises. Excellent performance, powerful features, and simple syntax.

github.com·21h

Heterogeneous Processing: A Strategy for Augmenting Moore's Law (2006)

linuxjournal.com·1d·

Discuss: Hacker News

🔀SIMD Programming

Intel Core Ultra "Arrow Lake Refresh" Chips Focus on E-core Count and L3 Cache Uplifts

techpowerup.com·14h

⚡Instruction Fusion

Beat the RAM shortage: How to get 32GB of Corsair DDR5 for cheap before prices climb even higher

techradar.com

·16h

📏Linear Memory

Clearing caches

artima.com·2d

🔗Weak References

An introduction to lockless algorithms [LWN.net]

lwn.net·20h

🎯Ring Buffers

LocalGPT: A local AI assistant with persistent memory in a single binary

localgpt.app·12h·

Discuss: Hacker News

💬Smalltalk VMs

Loading more...