⚡ Performance - Wazzaps · Scour

G.Skill explains how AMD EXPO ULL unlocks additional performance — expanded profiles allow memory makers to include subtiming tweaks for the first time

🧠Memory Allocators News

tomshardware.com

·

Records in Production: Where They Shine and Where They Silently Fail

🧠Memory Management

javacodegeeks.com·

Intel is turning the wrong clock: The Core Ultra 7 265K shows why Arrow Lake loses more at NGU than D2D can recover

🧠CPU Architecture

Apple WWDC On-Device AI Deep Dive - Google Docs

gist.is··Hacker News

HFT Latency Monitoring with Probabilistic Calling Context

⚙️Compilers

hftuniversity.com··Hacker News

ARTA: Adaptive Reinforcement-Learning-Based Throttling Agent for RowHammer Vulnerabilities

⏱️Tokio Academic

Elasticsearch simdvec deep-dive: Walking the memory tightrope to 2x better vector throughput

🧠CPU Architecture Blog

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

🤖AI Agents Blog

blogs.nvidia.com·

Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design

🤖AI Agents Blog

tilert.ai··Hacker News

Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors

🤖AI Agents Blog

aws.amazon.com··Hacker News

MLPerf and the rise of latency-aware LLM benchmarking

🧠AI Research

Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent

📱Edge AI Blog

dnhkng.github.io·

The Inference Alpha: Maximizing Frontier Models on AMD

📱Edge Computing Blog

digitalocean.com·

Why your database benchmarking data is probably wrong (and how I fixed mine)

⚙️Database Internals

developers.redhat.com·

bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss

🧠Memory Allocators Code

github.com··r/LocalLLaMA

SanDisk's massive 8TB SD cards are finally close to launch

🔐Hardware Security News

Tried to benchmark Google's new on-device dictation model and basically couldn't

getonit.ai··Hacker News

Benchmarking OpenZFS vs EXT4 for my NAS | Heitor's log

🏠Self-Hosting

heitorpb.github.io·

Massive AI Storage Demand Creates a New Memory Wall

📱Edge AI News

Why My Windows Benchmarks Were Lying — CPU Pinning, Power Caps, and What Variance Actually Tells You

🐧Linux News Blog

coloneltoad.substack.com··Substack

Log in to enable infinite scrolling