Occupancy Math on the AMD MI355X: A From-First-Principles Guide (opens in new tab)
A from-first-principles guide to wavefront occupancy on AMD's MI355X (CDNA4): the hardware resource budget, the four limiters that cap it, worked MXFP8 GEMM examples, and why peak throughput often lives at low occupancy.
Read the original article