🏗️ NUMA - hello · Scour

DeltaKV: Residual-Based KV Cache Compression via Long-Range Similarity

arxiv.org·12h

🌊Memory Bandwidth

Performance Tip of the Week #62: Identifying and reducing memory bandwidth needs

abseil.io·2d

🚀Performance

AFMTJ Model For In-Memory Computing (University of Arizona)

semiengineering.com·59m

DualMap: Enabling Both Cache Affinity and Load Balancing for Distributed LLM Serving

arxiv.org·1d

🧩Cache Partitioning

LocalGPT: A local AI assistant with persistent memory in a single binary

localgpt.app·22h·

Discuss: Hacker News

🔗Intrusive Containers

Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization

machinelearning.apple.com·17h

SectorC: a C compiler in 512 bytes

blog.adafruit.com·19h

CPUs are Back: The Datacenter CPU Landscape in 2026

newsletter.semianalysis.com

·22h·

Discuss: r/hardware

🏗️CPU Cache Topology

Hitting 1,000 tokens per second on a single RTX 5090

blog.alpindale.net·1d·

Discuss: Hacker News, Hacker News

📍CPU Pinning

Great Power, Great Latency: The Spider-Sense of NUMA Tuning

mydbanotebook.org·5d

🔄Hardware Transactional Memory

docs.modular.com·19h

NUASM — Neuro‑Universal‑ASM: The World's First Native Multi‑Language Assembler

dev.to·2d·

Discuss: DEV

The middle ground between canonical models and data mesh

frederickvanbrabant.com·3h·

Discuss: r/programming

🏗️Data Modeling

Market Winners and Losers of the Memory Chip Squeeze

bloomberg.com

·1h

Lucene HNSW performance: A deep dive into the OS page cache

opensearch.org·21h

The Meta Lattice update is driving real performance lift across Meta

brainlabsdigital.com·2h

🚀Performance

How Anam Achieved 250% Faster Inference Using Zymtrace Continuous GPU Profiling

zymtrace.com·1d

🎮SIMT Execution

How PCIe, NVLink, and NUMA Topology Affect GPU Scheduling Outcomes

dev.to·1d·

Discuss: DEV

Faster AI Training Unlocked With New System For Massive Language Models

quantumzeitgeist.com·1d

AMD Ryzen MAX 500 “Medusa Halo” rumored to support LPDDR6 memory

videocardz.com·3h

Loading more...