💾 Cache Algorithms - abnv · Scour

UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory

arxiv.org·23h

🧠Memory Allocators

lru_cache vs singleton in Python — they're not the same thing.

dev.to·5h·

Discuss: DEV

🔗Weak References

Intel Posts 2026 Update For Cache Aware Scheduling On Linux

phoronix.com·7h

🏗️CPU Architecture

Minimum Energy Per Query

semiengineering.com·20h

⏲️Embedded GC

BlaiseLM/gocache: A thread-safe, network-accessible LRU cache server written in Go.

github.com·1d·

Discuss: r/golang

DeltaKV: Residual-Based KV Cache Compression via Long-Range Similarity

arxiv.org·2d

🧠Memory Hierarchy

How caching helps in LLM Application?

dev.to·9h·

Discuss: DEV

🧠Memory Models

A RISC-V vector extension primer

blog.adafruit.com·13h

harishsg993010/tiny-NPU: opensource NPU for LLM inference (this run gpt2)

github.com·9h·

Discuss: r/LocalLLaMA

🗺️Region Inference

Zero State Architecture deep dive

news.ycombinator.com·11h·

Discuss: Hacker News

📡Erlang BEAM

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

venturebeat.com·6h

🗺️Region Inference

Supercharging Inference for AI Factories: KV Cache Offload as a Memory-Hierarchy Problem

blog.min.io·13h

⚡Cache-Aware Algorithms

Discussion - Investigation of Single Thread CPU "Thoughput/cycle"

forums.anandtech.com·1d

ianbarber.blog·1d·

Discuss: Hacker News

⚡Partial Evaluation

Avoiding UB but "safe" data race in a lock-free slab allocator - help - The Rust Programming Language Forum

users.rust-lang.org·1d

🔒Rust Borrowing

Intel Nova Lake Compute Tile Die Sizes Leak Highlighting Massive L3 Cache Expansion

hothardware.com·14h

⚡Instruction Fusion

[Development] 4MB 32-bit SRAM for the MicroMac Performer

68kmla.org·5h

🏷️Memory Tagging

Cache-aware disaggregated inference for up to 40% faster long-context LLM serving

together.ai·2d·

Discuss: Hacker News, r/LocalLLaMA

⏲️Embedded GC

AI in Multiple GPUs: Understanding the Host and Device Paradigm

towardsdatascience.com·15h

🤝Cooperative Threading

Why agile development is hard in hardware

evercurrent.substack.com·10h·

Discuss: Substack

🔗Language Toolchains

Loading more...