AQ4SViT: An Automated Quantization Framework with Search Gating Policy for Compressing Spiking Vision Transformers (opens in new tab)

Spiking Vision Transformers (SViTs) have emerged as alternative low-power ViT models, but their large sizes hinder their deployments on resource-constrained embedded AI systems. To address this, state-of-the-art works proposed quantization techniques to compress SViT models, but their manual, human-guided approach needs a huge design time and power/energy consumption to find the appropriate quantization setting for each given network, making thi...

Read the original article