⚡ Performance Engineering - SRv6d · Scour

Defeat the Heap: Zero-Copy Data Movement in AXI4MLIR

🔄Compiler Design Academic

arxiv.org··Hacker News

ModPageSpeed 2.0: Lighthouse 56 to 90. On your own servers

🚢DevOps Discussion

modpagespeed.com··Hacker News

Apple WWDC On-Device AI Deep Dive - Google Docs

🔄Compiler Design

gist.is··Hacker News

Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design

🧠Computer Architecture Blog

tilert.ai··Hacker News

HFT Latency Monitoring with Probabilistic Calling Context

👁️Observability

hftuniversity.com··Hacker News

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🧠Computer Architecture Code

github.com··Hacker News

ClaudeHeads

🔄Compiler Design Blog

fknil.pages.dev··Lobsters

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

🧠Computer Architecture Blog

ziraph.com··Hacker News

DiffusionGemma: 4x Faster Text Generation

🧠Computer Architecture News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

The perils of UUID primary keys in SQLite

🗄Database Systems

andersmurphy.com··Lobsters, Hacker News, r/programming

Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors

✅Formal Verification Blog

aws.amazon.com··Hacker News

A cute little trick to running classic IIR filters on the GPU

🧠Computer Architecture Blog

themaister.net··Hacker News

Global memory shortage throws wrench into IT pros’ budgets, planning

🧠Computer Architecture News

itbrew.com··Hacker News

Launch HN: General Instinct (YC P26) – Frontier models on edge devices

⚙️Engineering Discussion

news.ycombinator.com··Hacker News

aussiealex/agentmeter: Know what your agents cost. Cost intelligence for AI coding agents.

🌐HTMX Code

github.com··Hacker News

On-device AI is a margin decision

🧠Computer Architecture Blog

ziraph.com··Hacker News

The Road to Component Model 1.0

📦WebAssembly

bytecodealliance.org··Hacker News

Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves

🧠Computer Architecture

thefrontierlab.ai··Hacker News

How We Ditched Postgres for ClickHouse to Process 12 Billion Caches Per Day

🗄Database Systems Blog

momentic.ai··Hacker News

The economics of speculative decoding

🧠Computer Architecture Blog

fergusfinn.com··Hacker News

Log in to enable infinite scrolling