GPU Assembly, CUDA ISA, Kernel Optimization, Low-level Programming

My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·11h·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·3d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
GPU Pro – Master Your AI Workflow
github.com·17h·
🔍Nsight
Flag this post
onedraw — a GPU-driven 2D renderer
dev.to·23h·
Discuss: DEV
✂️CUTLASS
Flag this post
Intel's killed-off BMG-X3/X4 GPUs: 3D stacked die, up to 40 GPU cores, 512MB Adamantine cache
tweaktown.com·16h
📈GPU Occupancy
Flag this post
Troubleshooting multi-GPU with 2 RTX PRO 6000 Workstation Edition
reddit.com·3h·
Discuss: r/LocalLLaMA
⏱️CUDA Events
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·1d
🧠CPU Architecture
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·7h
🧮cuDNN
Flag this post
The Evolution of GPUs: How Floating-Point Changed Computing
dell.com·23h·
Discuss: Hacker News
🎯Tensor Cores
Flag this post
Writing a DOS Clone in 2019
medium.com·8h·
Discuss: Hacker News
⚙️Systems Programming
Flag this post
Armada Launches Bridge to Power the Next Generation of AI Infrastructure
prnewswire.com·2h
🔗NCCL
Flag this post
Machine Scheduler in LLVM – Part II
myhsu.xyz·1d·
📈Occupancy Optimization
Flag this post
You Don't Always Need Grafana for GPU Monitoring
dev.to·1d·
Discuss: DEV
🔍Nsight
Flag this post
Challenging the Fastest OSS Workflow Engine
obeli.sk·3d·
🌊CUDA Streams
Flag this post
Nvidia GeForce RTX 5070 Ti vs AMD Radeon 9070 XT with DLSS and FSR Enabled
techspot.com·2h
🔍Nsight
Flag this post
A portable picokernel for async I/O
ryansepassi.com·2d·
Discuss: Hacker News
📊Profiling Tools
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·8h
🔗NCCL
Flag this post
I turned a dead GPU into a hardware encoder, and it's perfect for my NAS
xda-developers.com·13h
🔍Nsight
Flag this post
Building Yantra: A Visual Workflow Automation Engine
patali.dev·9h·
Discuss: Hacker News
🤖Automation
Flag this post