⏩ SIMD - hello · Scour

Heterogeneous Processing: A Strategy for Augmenting Moore's Law (2006)

linuxjournal.com·1d·

Discuss: Hacker News

⚡Hardware Acceleration

Protean Compiler: An Agile Framework to Drive Fine-grain Phase Ordering

arxiv.org·10h

📊Profile-Guided Optimization

How Anam Achieved 250% Faster Inference Using Zymtrace Continuous GPU Profiling

zymtrace.com·13h

🎮SIMT Execution

Main Content || Math ∩ Programming

jeremykun.com·16h

Building Scalable AI Applications: Architecture Patterns That Actually Work

dev.to·1d·

Discuss: DEV

🎭Program Synthesis

Faster AI Training Unlocked With New System For Massive Language Models

quantumzeitgeist.com·1h

Vector Databases Explained: Architecture and System Design for AI Apps

dev.to·9h·

Discuss: DEV

🧮Vector Databases

Your VCL App: 4x to 11x Faster Math Performance with Elements

blogs.remobjects.com·53m·

Discuss: Hacker News

Concurrent vs. Parallel Execution in LLM API Calls: From an AI Engineer’s Perspective

pub.towardsai.net·9h

⏰Timely Dataflow

Implementing vector

accu.org·1d·

Discuss: r/cpp

📌Pin Projection

Dirk Eddelbuettel: chronometre: A new package (pair) demo for R and Python

dirk.eddelbuettel.com·21h

🐻‍❄️Polars

CUDA Guide: Workflow for Performance Tuning

digitalocean.com·4d

🎮SIMT Execution

How AI coding makes developers 56% faster and 19% slower

thenewstack.io·3h

🎭Program Synthesis

What should I program?

jamesmcm.github.io·1d

My Claude Code workflow

invertedpassion.com·2h

🔨Incremental Compilation

Writing a ONNX Neural Network Inference Engine from Scratch in C to run image classification with MobileNetV2

flexw.github.io·20h·

Discuss: r/C_Programming

Hitting 1,000 tokens per second on a single RTX 5090

blog.alpindale.net·16h·

Discuss: Hacker News

📍CPU Pinning

Quantized Tensor Train Compression For Turbulent Flow Simulation: O(log N) Scaling with Reynolds-Independent Bond Dimension

zenodo.org·2h·

Discuss: Hacker News

⏲️TimescaleDB Compression

Accelerate your discovery by parallelizing experiments

magellink.com·22h·

Discuss: Hacker News

More cores didn't improve my workflow

xda-developers.com·1d

🚀Performance

Loading more...