Integer Quantization: Deep Dive (opens in new tab)
A lot has happened in transformer quantization over the past few years, from barely being able to quantize a 7B model in INT8 without...
Read the original articleA lot has happened in transformer quantization over the past few years, from barely being able to quantize a 7B model in INT8 without...
Read the original article