📈 Performance Profiling - widget101 · Scour

Anubis OSS — Local LLM Benchmarking for Apple Silicon

devpadapp.com·3d·

Discuss: r/opensource

📊Columnar Engines

The Performance Paradox: When Doing Less Work Makes Your Code Slower

dev.to·5d·

Discuss: DEV

🔍Memory Profilers

Show HN: GPU ROI simulator based on token usage and model architecture

axiomos.ai·2d·

Discuss: Hacker News

📊Columnar Engines

LLM Performance in Astro, React, Tailwind and Cloudflare

10xbench.ai·2d·

Discuss: Hacker News

Compiling the Vision Encoder: Squeezing 3% More Throughput from Qwen3-VL on Hopper GPUs

github.com·4d·

Discuss: DEV

📊Columnar Engines

AI in Multiple GPUs: Understanding the Host and Device Paradigm

towardsdatascience.com·17h

🏗️Hardware Architecture

Hitting 1,000 tokens per second on a single RTX 5090

blog.alpindale.net·4d·

Discuss: Hacker News, Hacker News

💾Cache Optimization

Minimum Energy Per Query

semiengineering.com·22h

💾Cache Optimization

Concurrent vs. Parallel Execution in LLM API Calls: From an AI Engineer’s Perspective

pub.towardsai.net·4d

🔄Concurrency

Boosting LCP: A Guide to fetchpriority="high"

dev.to·6d·

Discuss: DEV

💾Cache Optimization

Guney-olu/nanoslg: A from-scratch implementation of distributed LLM inference in simple readable Python

github.com·3d·

Discuss: Hacker News, r/LLM

📊Columnar Engines

Evolving our real-time timeseries storage again: Built in Rust for performance at scale

datadoghq.com·4d

🏛️Lakehouse Architecture

Heterogeneous Processing: A Strategy for Augmenting Moore's Law (2006)

linuxjournal.com·4d·

Discuss: Hacker News

⚡SIMD Optimization

When Bigger Instances Don’t Scale

scylladb.com·2d·

Discuss: r/programming

👁️Observability

Optimizing the MongoDB Java Driver: How minor optimizations led to macro gains

linkedin.com·1d·

Discuss: DEV

💾Cache Optimization

Container Timing: measuring web components performance

blogs.igalia.com·2d·

Discuss: Hacker News

👁️Observability

Comprehensive System-Level Performance Model For p-SRAM-Based IMC (USC, UW-Madison)

semiengineering.com·6d

🏗️Hardware Architecture

How AI coding makes developers 56% faster and 19% slower

thenewstack.io·3d

MiniMax M2.5: Game-Changer with 80% Coding Benchmark Score

news.reading.sh·10h·

Discuss: Hacker News

📊Columnar Engines

I struggled with system design until I learned these 114 concepts

newsletter.systemdesign.one

·5d

🏛️Lakehouse Architecture

Loading more...