My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.ioยท1dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
How NVIDIA GeForce RTX GPUs Power Modern Creative Workflows
blogs.nvidia.comยท4h
โฑ๏ธCUDA Events
Flag this post
Troubleshooting multi-GPU with 2 RTX PRO 6000 Workstation Edition
reddit.comยท1dยท
Discuss: r/LocalLLaMA
โฑ๏ธCUDA Events
Flag this post
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
daft.aiยท1hยท
Discuss: Hacker News
๐Ÿ”—NCCL
Flag this post
GPU Pro โ€“ Master Your AI Workflow
github.comยท1dยท
๐Ÿ”Nsight
Flag this post
Giga Computing Announces Worldwide Availability of Its NVIDIA RTX PRO Server
prnewswire.comยท42m
๐Ÿ”Nsight
Flag this post
Inline vs. Pipeline Ray Tracing
evolvebenchmark.comยท4hยท
Discuss: Hacker News
โฑ๏ธCUDA Events
Flag this post
Why stop at 1 million tokens when you can have 10? My journey to extreme context on a gaming GPU. [P]
reddit.comยท7hยท
๐ŸŽ๏ธTensorRT
Flag this post
Tetris: An SLA-aware Application Placement Strategy in the Edge-Cloud Continuum
arxiv.orgยท13h
๐ŸŒDistributed Computing
Flag this post
PCIe lanes are the real currency of modern PCs
xda-developers.comยท1d
โฑ๏ธCUDA Events
Flag this post
eBPF Tutorial by Example: Monitoring GPU Driver Activity with Kernel Tracepoints
dev.toยท11hยท
Discuss: DEV
โฑ๏ธCUDA Events
Flag this post
Small form factor, big impact: Solving edge computingโ€™s space and performance paradox
nordot.appยท1d
๐ŸŒDistributed Computing
Flag this post
Planning a Multi-Monitor Setup with KVM โ€“ Advice Needed
amazon.deยท2hยท
Discuss: r/homelab
โฑ๏ธCUDA Events
Flag this post
Co-Simulation Framework for Parallel DNN Execution on Chiplet-Based Systems (UWโ€“Madison, Washington State)
semiengineering.comยท21h
๐ŸŒŠCUDA Streams
Flag this post
Voxel Grid Visibility
cod.ifies.comยท2hยท
โœ‚๏ธCUTLASS
Flag this post
7 ways networking powers your AI workloads on Google Cloud
cloud.google.comยท1h
โšกONNX Runtime
Flag this post
Arista Modular Switches Aim At Scale Across Networks, Hit Scale Out, Too
nextplatform.comยท1h
๐ŸŒŠCUDA Streams
Flag this post
Synopsys and NVIDIA Forge AI Powered Future for Chip Design and Multiphysics Simulation
semiwiki.comยท1d
๐ŸŒŠCUDA Streams
Flag this post
Lazy loading isn't the magic pill to fix AI Inference
tensorfuse-docs.mintlify.devยท4hยท
Discuss: Hacker News
๐Ÿš€MLOps
Flag this post