Why apply G-Zip after pruning or weight clustering?

Mohanish_Nehete · June 10, 2021, 10:59am

Why do we apply the standard compression algorithm after the pruning or weight clustering ?

To use the model, anyhow we need to unzip it, right ?
so what difference does it make ??

Rino_Lee · June 11, 2021, 1:40am

Hi Mohanish.

This is because the tflite model storing the pruned elements without changing their data type. But the modern compression algorithms pretty efficiently compresses the zero values, so it can reduce the model size after compression.

Yes, the compression might need to undo before the execution, however, considering the tflite is designed for mobile/ioT devices, the model is usually compressed while delivering and storing it to the device. So it is meaningful to have reduced (compressed) model size.

hope this help, any more discussion is welcome!

Mohanish_Nehete · June 16, 2021, 6:40am

Hello @Rino_Lee ,

So, please correct my understanding from your reply.
The models are compressed automatically by tflite when storing it in Microcontroller ?
or it should be done manually ?

Rino_Lee · June 16, 2021, 7:59am

That’s not done by tflite. It should be done manually by the application developer.

Mohanish_Nehete · June 18, 2021, 6:52am

But when converting the tflite model to C array for deployment on microcontrollers the zip is not used.
So, the g-zip step is only for theoretical use/results and has no practical meaning ?
Because I cannot find any practical flow for this.

andrew_stevens1 · July 21, 2021, 3:51pm

An example of a practical flow is ARM’s “VELA” tooling for the U55 NPU. This post-processes tflite flatbuffers, compressing constant weight tensors (U55 supports on-they-fly HW weight decompression ).

Topic		Replies	Views
TensorFlow lite internal structure General Discussion docs , tflite , model_optimization	6	1230	June 10, 2021
Densify op not implemented in TF Lite Micro TensorFlow models , tflite_micro	3	522	June 30, 2025
Pruning pre-trained model with model-optimization General Discussion model_optimization	3	477	August 18, 2023
Size reduction for TFlite Models General Discussion models , tflite	4	353	December 28, 2023
No model size reduction in Tflite model size with integer Quantisation General Discussion models , keras , tflite , model_optimization , help_request	6	2115	July 7, 2021

Why apply G-Zip after pruning or weight clustering?

Related topics