GPU Profiling, CUDA Debugging, Performance Analysis, Trace

GPU Pro – Master Your AI Workflow
github.com·1d·
⏱️CUDA Events
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·18h·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Glances 4.4.0: System monitor gets Python API and Neofetch mode
heise.de·54m
📦uv
Flag this post
You Don't Always Need Grafana for GPU Monitoring
dev.to·1d·
Discuss: DEV
⏱️CUDA Events
Flag this post
FOCUS: Efficient Keyframe Selection for Long Video Understanding
arxiv.org·15h
👁️Attention Optimization
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·14h
🧮cuDNN
Flag this post
AMD releases statement about new game support for older Radeon GPUs
tweaktown.com·12h
🎮NVIDIA
Flag this post
Troubleshooting multi-GPU with 2 RTX PRO 6000 Workstation Edition
reddit.com·10h·
Discuss: r/LocalLLaMA
⏱️CUDA Events
Flag this post
FSWatcher: A new cross platform file watcher for MacOS, Linux and Windows
reddit.com·8h·
Discuss: r/golang
📊Profiling Tools
Flag this post
Nvidia GeForce RTX 5070 Ti vs AMD Radeon 9070 XT with DLSS and FSR Enabled
techspot.com·9h
📈GPU Occupancy
Flag this post
In the Foundry of Imagination: The Forged Studio Story
tympanus.net·3h
Flash Attention
Flag this post
Show HN: a Rust ray tracer that runs on any GPU – even in the browser
github.com·6h·
Discuss: Hacker News
⏱️CUDA Events
Flag this post
Synopsys and NVIDIA Forge AI Powered Future for Chip Design and Multiphysics Simulation
semiwiki.com·6h
🌊CUDA Streams
Flag this post
onedraw — a GPU-driven 2D renderer
dev.to·1d·
Discuss: DEV
✂️CUTLASS
Flag this post
Faster root cause for slow traces with ClickStack Event Deltas
clickhouse.com·4h
⏱️CUDA Events
Flag this post
How Data 360 Vector Search Delivers Near Real-Time Intelligence on 90% of Enterprise Data
engineering.salesforce.com·4h
🤖AI Coding Tools
Flag this post
AI-Deployable STM32N6 Open-Source Edge AI Camera
hackster.io·12h
🧮cuDNN
Flag this post
A more native experience for Cloud TPUs with Ray on GKE
cloud.google.com·3h
🚀MLOps
Flag this post
T3 with 1.2A driver and CUN66A1G + comparisons + behind the scenes!
old.reddit.com·3h·
Discuss: r/flashlight
⏱️Benchmarking
Flag this post
I repurposed my old GPU for self-hosted AI and it changed my life
xda-developers.com·4h
🤖AI Coding Tools
Flag this post