Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·5h·
Discuss: Hacker News
✂️CUTLASS
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·18h·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
I'm a beginner at C and I would like feedback about the optimisation of my code
reddit.com·1d·
🔍Type Checkers
Flag this post
onedraw — a GPU-driven 2D renderer
dev.to·1d·
Discuss: DEV
✂️CUTLASS
Flag this post
The Evolution of GPUs: How Floating-Point Changed Computing
dell.com·1d·
Discuss: Hacker News
🎯Tensor Cores
Flag this post
Tetrahedral analog of the Pythagorean theorem
johndcook.com·3h
🔢cuBLAS
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·3d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·1d·
Discuss: Hacker News
📜TorchScript
Flag this post
A fun application of Green’s functions and geometric algebra: Residue calculus
peeterjoot.com·15h
🔢cuBLAS
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·14h
🔗NCCL
Flag this post
Troubleshooting multi-GPU with 2 RTX PRO 6000 Workstation Edition
reddit.com·10h·
Discuss: r/LocalLLaMA
⏱️CUDA Events
Flag this post
Intel's killed-off BMG-X3/X4 GPUs: 3D stacked die, up to 40 GPU cores, 512MB Adamantine cache
tweaktown.com·23h
🔧PTX
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocks·4d·
Discuss: Hacker News
⏱️CUDA Events
Flag this post
How to Use Multimodal AI Models With Docker Model Runner
docker.com·6h
🔄ONNX
Flag this post
I made a basic arcade machine with this ESP32-powered display
xda-developers.com·8h
Flash Attention
Flag this post
I made a tensor runtime & inference framework in C (good for learning how inference works)
github.com·18h·
📜TorchScript
Flag this post
How to get the GOT address from a PLT stub using GDB
rafaelbeirigo.github.io·1d·
Discuss: Hacker News
📊Profiling Tools
Flag this post
Free Functions Don't Change Performance (Much)
16bpp.net·6h·
Discuss: Hacker News, r/cpp
📊Profiling Tools
Flag this post
Masked Softmax Layers in PyTorch
mcognetta.github.io·4h·
Discuss: Hacker News
🔥PyTorch
Flag this post
Deep Integration and the Convergence of Model Architecture and Hardware in AI
dev.to·23h·
Discuss: DEV
🎯Tensor Cores
Flag this post