The latest Gemma 4 models use a training trick to slash their on-device memory footprint (opens in new tab)

You can now download Gemma 4 models with quantization-aware training to reduce the amount of mobile memory required to 1GB.