To Sparsify or To Quantize: A Hardware Architecture View (opens in new tab)
The debate of sparsity versus quantization has made its rounds in the ML optimization community for many years. Now, with the Generative AI revolution, the debate is intensifying. While these might…
Read the original article