A hitchhiker's guide to CUDA programming
🎯GPU Kernels
Flag this post
A fast spectral overlapping domain decomposition method with discretization-independent conditioning bounds
arxiv.org·1d
✂️CUTLASS
Flag this post
Unlock Linear Solver Speed: Symbolic Preconditioning for Hyper-Performance
🎯Tensor Cores
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
🔗NCCL
Flag this post
The next RISC-V processor frontier: AI
edn.com·1d
🧠CPU Architecture
Flag this post
Building a Rules Engine from First Principles
towardsdatascience.com·2d
📉Model Quantization
Flag this post
Dynamical mean field theory for real materials on a quantum computer
nature.com·1d
🔗Kernel Fusion
Flag this post
Opportunistically Parallel Lambda Calculus
💡LSP
Flag this post
Vectorizing for Fun and Performance
🔄SIMD Programming
Flag this post
I made Matrix rain that turns your audio into colors - each voice/instrument paints a unique hue in real-time
⚡Flash Attention
Flag this post
I tested Arc Raiders across four GPUs of different ages — optimization still exists
xda-developers.com·2h
🔧PTX
Flag this post
Fungus: The Befunge CPU(2015)
⚙️Systems Programming
Flag this post
How to Convert Cubic Bézier Curves into Euler Spirals for GPU Optimization
hackernoon.com·2d
✂️CUTLASS
Flag this post
Loading...Loading more...