CHIP8 โ€“ writing emulator, assembler, example game and VHDL hardware impl
blog.dominikrudnik.plยท11hยท
Discuss: Hacker News
โœ‚๏ธCUTLASS
Flag this post
Comparing images with AVX
dev.toยท1dยท
Discuss: DEV
โœ‚๏ธCUTLASS
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.comยท2d
๐Ÿง CPU Architecture
Flag this post
Geonum โ€“ geometric number library for unlimited dimensions with O(1) complexity
github.comยท18hยท
Discuss: Hacker News
โœ‚๏ธCUTLASS
Flag this post
Playing Around with ARM Assembly
blog.nobaralabs.comยท4hยท
Discuss: Hacker News
๐Ÿ“ŠProfiling Tools
Flag this post
Vectorizing for Fun and Performance
ibm.comยท5dยท
Discuss: Hacker News
โœ‚๏ธCUTLASS
Flag this post
H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention
arxiv.orgยท3h
โšกFlash Attention
Flag this post
Limitations of a two-pass assembler
boston.conman.orgยท5h
๐Ÿš€Compiler Optimization
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.ioยท1dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
Don't let these 3 CPU specs trick you into paying more
xda-developers.comยท12h
โšกFlash Attention
Flag this post
Free Functions Don't Change Performance (Much)
16bpp.netยท18hยท
Discuss: Hacker News, r/cpp
๐Ÿ“ŠProfiling Tools
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.meยท4dยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
Programming for Computations: Matlab/Octave
link.springer.comยท1dยท
Discuss: Hacker News
๐ŸŒDistributed Computing
Flag this post
Low-Level Hacks
blog.raycursive.comยท6hยท
Discuss: Hacker News
๐Ÿ“ŠProfiling Tools
Flag this post
Cycle-accurate 6502 emulator as coroutine in Rust
github.comยท2dยท
๐Ÿ“ŠProfiling Tools
Flag this post
Real-time stock volatility prediction with deep learning on a time-series DB
medium.comยท11mยท
Discuss: Hacker News
โšกONNX Runtime
Flag this post
More Evidence for AVX10 and APX Support in Intel "Nova Lake" Emerge
techpowerup.comยท1d
๐Ÿง CPU Architecture
Flag this post
Disciplined Biconvex Programming
arxiv.orgยท3h
๐Ÿ“‰Model Quantization
Flag this post
Big-O Notation: Explained in 8 Minutes
blog.algomaster.ioยท5h
๐Ÿš€Compiler Optimization
Flag this post
Can-t stop till you get enough
cant.bearblog.devยท1dยท
Discuss: Hacker News
๐Ÿ“œTorchScript
Flag this post