🎯 GPU Kernels - miterion · Scour

Learn the shader techniques that set you apart in this course

fragments.supply·1d

High-performance GPU implementation of Wannier interpolation of the electron-phonon interaction for transport properties

link.aps.org·2d

⏱️CUDA Events

Adventures in Neural Rendering

interplayoflight.wordpress.com·1d·

Discuss: Hacker News

📊Gradient Accumulation

Lucene HNSW performance: A deep dive into the OS page cache

opensearch.org·2d

📊Profiling Tools

NVIDIA GeForce NOW Is Coming to India and Your Phone Is About to Do More than Just Run BGMI

beebom.com·2d

New microkernel OS in 10 days: From zero to Google Compute Engine

seiya.me·3d·

Discuss: Hacker News

⚙️Systems Programming

laphilosophia/strime: Streaming projection engine — extract fields at multi-gigabit speeds with O(1) memory

github.com·1d·

Discuss: r/node

⚡Flash Attention

An in-kernel machine-learning library

lwn.net·5d

🔗Kernel Fusion

‘India’s bet on smaller AI models may overlook CPUs’: Ziroh Labs CEO Hrishikesh Dewan

indianexpress.com·2d

⚡ONNX Runtime

How PCIe, NVLink, and NUMA Topology Affect GPU Scheduling Outcomes

dev.to·3d·

Discuss: DEV

📊CUDA Graphs

Unpopular Opinion: Jensen Huang Is Making Nvidia Its Own Worst Enemy

finance.yahoo.com·3d

Anubis OSS — Local LLM Benchmarking for Apple Silicon

devpadapp.com·2d·

Discuss: r/opensource

📊Profiling Tools

ZipFlow: a Compiler-based Framework to Unleash Compressed Data Movement for Modern GPUs

arxiv.org·2d

🌊CUDA Streams

GamingOnLinux - Vulkan-based translation layer D7VK officially expands to include Direct3D 5 support

store.steampowered.com·3d

📈GPU Occupancy

Linux Kernel 6.19 Officially Released, This Is What’s New

news.tuxmachines.org·1d

⏱️CUDA Events

How GPU Cloud Providers Handle Long-Tail Job Backlogs

acecloud.ai·3d·

Discuss: DEV

The RAM shortage finally convinced me to learn memory overclocking

xda-developers.com·7h

📈Occupancy Optimization

Leading Inference Providers Cut AI Costs by up to 10x With Open Source Models on NVIDIA Blackwell

blogs.nvidia.com·4h

🏎️TensorRT

Life support build: Breaking all the rules to build a productivity PC beast

tomshardware.com

·2d

🏗️Build Systems

GeForce RTX 6090 in 2028 at the earliest: When memory shortages dictate Nvidia's roadmap

igorslab.de·3d

⏱️CUDA Events

Loading more...