Intro to SIMD for 3D graphics
🧠CPU Architecture
Flag this post
Vectorizing for Fun and Performance
⚙️Compilers
Flag this post
12 AIE-ML kernel vectorization
hackster.io·3w
⚙️Compilers
Flag this post
The state of SIMD in Rust in 2025
🌐WebAssembly
Flag this post
Comparing images with AVX
🎨Computer Graphics
Flag this post
Matrix Multiplication in CUDA
🎨Computer Graphics
Flag this post
10-26-building-the-rope-operation-for-tensorrent-hardware at Clehaxze
clehaxze.tw·1w
📐Linear Algebra
Flag this post
Hashing multiple blobs with BLAKE3
iroh.computer·3w
⚙️Compilers
Flag this post
Multiple Rows Mixers and Hsilu - A Family of Linear Layers and A Permutation with Fewer XORs
eprint.iacr.org·2w
⚙️Compilers
Flag this post
NextSilicon Takes Aim At CPUs And GPUs With “Maverick-2” Dataflow Engine
🧠CPU Architecture
Flag this post
Crushing ML Latency: The (Un)Official Best Practices for Systems Optimisation
pub.towardsai.net·1d
⚙️Compilers
Flag this post
Learning Triton One Kernel at a Time: Matrix Multiplication
towardsdatascience.com·3w
🎨Computer Graphics
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
🔢Number Theory
Flag this post
Inference Acceleration from the Ground Up
semiwiki.com·1w
🧠CPU Architecture
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·4d
🧠CPU Architecture
Flag this post
Loading...Loading more...