Comparing images with AVX
dev.toยท6hยท
Discuss: DEV
โœ‚๏ธCUTLASS
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.comยท18h
๐Ÿง CPU Architecture
Flag this post
Vectorizing for Fun and Performance
ibm.comยท4dยท
Discuss: Hacker News
โœ‚๏ธCUTLASS
Flag this post
How to get the GOT address from a PLT stub using GDB
rafaelbeirigo.github.ioยท7hยท
Discuss: Hacker News
๐Ÿ“ŠProfiling Tools
Flag this post
Cycle-accurate 6502 emulator as coroutine in Rust
github.comยท1dยท
๐Ÿ“ŠProfiling Tools
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.meยท3dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
Can-t stop till you get enough
cant.bearblog.devยท4hยท
Discuss: Hacker News
๐Ÿ“œTorchScript
Flag this post
I'm a beginner at C and I would like feedback about the optimisation of my code
reddit.comยท9hยท
๐Ÿ”Type Checkers
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.comยท5hยท
Discuss: Substack
๐Ÿ“‰Model Quantization
Flag this post
The Evolution of GPUs: How Floating-Point Changed Computing
dell.comยท9hยท
Discuss: Hacker News
๐ŸŽฏTensor Cores
Flag this post
The middle brother in classifier development: What is RandAugment?
openaccess.thecvf.comยท11hยท
Discuss: DEV
๐Ÿ“ŠGradient Accumulation
Flag this post
Some Fun Videos on Optimizing NES Code
bumbershootsoft.wordpress.comยท1d
๐Ÿš€Compiler Optimization
Flag this post
Rasterizer Project - Part 3: Geometry
dev.toยท1dยท
Discuss: DEV
โœ‚๏ธCUTLASS
Flag this post
The next RISC-V processor frontier: AI
edn.comยท2d
๐Ÿง CPU Architecture
Flag this post
Optimizing Debian packages
grulic.org.arยท6h
๐Ÿ“ฆuv
Flag this post
Integer overflow checking with C23
blog.gnoack.orgยท3h
๐Ÿ”ฌStatic Analysis
Flag this post
Q&A #80 (2025-10-31)
computerenhance.comยท1d
๐Ÿ“ŠProfiling Tools
Flag this post
Cure โ€“ Verification-First Programming for the Beam
cure-lang.orgยท5hยท
Discuss: Hacker News
๐Ÿ“œTorchScript
Flag this post
Qwen3 VL 30b a3b is pure love
reddit.comยท3hยท
Discuss: r/LocalLLaMA
๐Ÿ“‰Model Quantization
Flag this post
Intel's killed-off BMG-X3/X4 GPUs: 3D stacked die, up to 40 GPU cores, 512MB Adamantine cache
tweaktown.comยท2h
๐Ÿ”งPTX
Flag this post