My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.ioยท1dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
Troubleshooting multi-GPU with 2 RTX PRO 6000 Workstation Edition
reddit.comยท1dยท
Discuss: r/LocalLLaMA
โฑ๏ธCUDA Events
Flag this post
GPU Pro โ€“ Master Your AI Workflow
github.comยท1dยท
๐Ÿ”Nsight
Flag this post
DCcluster-Opt: Benchmarking Dynamic Multi-Objective Optimization for Geo-Distributed Data Center Workloads
arxiv.orgยท4h
๐Ÿ”—NCCL
Flag this post
PCIe lanes are the real currency of modern PCs
xda-developers.comยท1d
โฑ๏ธCUDA Events
Flag this post
eBPF Tutorial by Example: Monitoring GPU Driver Activity with Kernel Tracepoints
dev.toยท2hยท
Discuss: DEV
โฑ๏ธCUDA Events
Flag this post
Synopsys and NVIDIA Forge AI Powered Future for Chip Design and Multiphysics Simulation
semiwiki.comยท19h
๐ŸŒŠCUDA Streams
Flag this post
Small form factor, big impact: Solving edge computingโ€™s space and performance paradox
nordot.appยท18h
๐ŸŒDistributed Computing
Flag this post
Context Engineering for Agents
pub.towardsai.netยท1d
๐Ÿค–AI Coding Tools
Flag this post
Co-Simulation Framework for Parallel DNN Execution on Chiplet-Based Systems (UWโ€“Madison, Washington State)
semiengineering.comยท13h
๐ŸŒŠCUDA Streams
Flag this post
Attention Is All You Need for KV Cache in Diffusion LLMs
paperium.netยท5hยท
Discuss: DEV
๐ŸŽฏTensor Cores
Flag this post
Evolving Ray and Kubernetes together for the future of distributed AI and ML
cloud.google.comยท16h
๐ŸŒDistributed Computing
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.meยท4dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
Where to Buy or Rent GPUs for LLM Inference: The 2026 GPU Procurement Guide
bentoml.comยท3dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
A behind-the-scenes look at Broadcomโ€™s design labs
techbrew.comยท12hยท
Discuss: Hacker News
โฑ๏ธCUDA Events
Flag this post
AMD confirms its separate drivers for RDNA 1/2 and RDNA 3/4 GPUs will roll out at the same time
tweaktown.comยท4h
โฑ๏ธCUDA Events
Flag this post
The PVS system in our 3d metroid-like
reddit.comยท8hยท
Discuss: r/gamedev
๐Ÿ”งPTX
Flag this post
Uncrossed Multiflows and Applications to Disjoint Paths
arxiv.orgยท4h
๐Ÿ“ŠCUDA Graphs
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.orgยท4h
๐Ÿ”„ONNX
Flag this post
Replacing my old desktop, a high-end Linux PC
boyter.orgยท1d
๐Ÿ”งPTX
Flag this post