⚡ Cache-Aware Algorithms - abnv · Scour

UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory

arxiv.org·1d

🧠Memory Allocators

Breaking the Tractability Barrier: A Generic Low-Level Solver for NP-Hard Instances (N=63) on Commodity 64-Bit Silicon

zenodo.org·1h·

Discuss: Hacker News

How caching helps in LLM Application?

dev.to·16h·

Discuss: DEV

🧠Memory Models

Execution-Centric Characterization of FP8 Matrix Cores, Asynchronous Execution, and Structured Sparsity on AMD MI300A

arxiv.org·1d

🎯CPU Dispatch

Supercharging Inference for AI Factories: KV Cache Offload as a Memory-Hierarchy Problem

blog.min.io·20h

🧠Memory Hierarchy

Building an Embedding API with Rust, Arm, and EmbeddingGemma on AWS Lambda

sobolev.substack.com·44m·

Discuss: Substack

📋JSON Parsing

Minimum Energy Per Query

semiengineering.com·1d

⏲️Embedded GC

How octorus Renders 300K Lines of Diff at High Speed

dev.to·4h·

Discuss: DEV

🌊Async Compilers

Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues

semiengineering.com·1d

The Fourth Wave of Computing

lucibrowser.com·1h·

Discuss: Hacker News

🌱Green Threads

Intel Posts 2026 Update For Cache Aware Scheduling On Linux

phoronix.com·14h

💾Cache Algorithms

Cache-aware disaggregated inference for up to 40% faster long-context LLM serving

together.ai·2d·

Discuss: Hacker News, r/LocalLLaMA

⏲️Embedded GC

Optimizing the MongoDB Java Driver: How minor optimizations led to macro gains

linkedin.com·1d·

Discuss: DEV

⚡Interpreter Optimization

AI in Multiple GPUs: Understanding the Host and Device Paradigm

towardsdatascience.com·22h

🤝Cooperative Threading

Avoiding UB but "safe" data race in a lock-free slab allocator - help - The Rust Programming Language Forum

users.rust-lang.org·1d

🔒Rust Borrowing

C++20 matching engine - arena allocator, lock-free SPSC, intrusive linked lists, 255ns p50 latency

github.com·6h·

Discuss: r/cpp

🔢Algebraic Datatypes

Zero State Architecture deep dive

news.ycombinator.com·18h·

Discuss: Hacker News

📡Erlang BEAM

Best CPU 2026 – the top AMD Ryzen and Intel Core processors tested

club386.com·1h

🔀SIMD Programming

Performance Tip of the Week #62: Identifying and reducing memory bandwidth needs

abseil.io·5d

⚡Cache Optimization

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

venturebeat.com·13h

🗺️Region Inference

Loading more...