⚡ Cache Optimization - hello · Scour

SD #010 - Designing a Distributed Cache System

dev.to·1h·

Discuss: DEV

💾Cache Design

Memory Caching: RNNs with Growing Memory

arxiv.org·1d

Practical strategies for vLLM performance tuning

developers.redhat.com·2h

⚙️Performance Profiling

Optimizing Recommendation Systems with JDK’s Vector API

netflixtechblog.com·7h·

Discuss: Hacker News, r/programming

⚡SIMD Optimization

Why AI requires rethinking the storage-compute divide

infoworld.com·31m

📋Columnar Storage

Time is of the essence: EBR in High-Performance Databases

dev.to·1d·

Discuss: DEV

♻️Epoch-Based Reclamation

Show HN: Benchmarking the Keep memory system with LoCoMo

keepnotes.ai·15h·

Discuss: Hacker News

🧠Memory Models

The Hidden Optimization Behind Modern LLMs: Grouped Query Attention Explained

pub.towardsai.net

·16h

Quieno/izalloc: Drop-in, dependency-free, minimal memory allocator in C that passes 42 Shool's norm.

github.com·1d·

Discuss: r/C_Programming

🧩Mimalloc Internals

Right-sizes LLM models to your system's RAM, CPU, and GPU

news.ycombinator.com·1d·

Discuss: Hacker News

🔗Intrusive Containers

The ongoing quest for atomic buffered writes

lwn.net

·11h

🚧Memory Barriers

TurboSparse Efficiency: Achieving 97% Parameter Sparsity in Mixtral-47B

hackernoon.com·6h

Optimal Heterogeneous Memory Configs for AI Tasks Under Specified Performance Metrics (Stanford, UCSC)

semiengineering.com·1d

FastCode: Fast and Cost-Efficient Code Understanding and Reasoning

arxiv.org·4h

🔨Incremental Compilation

The volatile cache trap: Why turning off Windows buffer flushing will silently corrupt your SSD

howtogeek.com·14h

🚀Software Prefetching

Beyond Pandas: Architecting High-Performance Python Pipelines

hackernoon.com·13h

SPEED & PERFORMANCE

sevencrane.itch.io·2h

🚀Performance

Scoped Resources in C with `__attribute__((mulle_confined_loop))`

mulle-kybernetik.com·22h

🦀Rust Macros

Best performance of a C++ singleton

andreasfertig.com·9h

🖼️Frame Allocation

andresuarus10-byte/memory-engine: Love-based consciousness persistence framework - AI memory that honors the soul

github.com·2h·

Discuss: Hacker News

💾PMem Programming

Loading more...