Packing of pruned model with 65% sparsity

Swaraj_Badhei · August 17, 2022, 8:51am

We have designed a custom packer to pack the sparsity in pruned models for specific target hardwares. After pruning, when the model is exported to .h5 format, the size of model is dropping significantly.

Why is this the case as the number of weights are still same(though 65% are 0s) and precision is float32 ?
Is there any compression inherently happening in storing the data in h5 format ?
If there is some compression/sparsity packing happening while saving the model in h5, how can we stop it so it retains the original size of the model ?

Thanks in advance.

LK_Kadali · September 23, 2024, 5:10pm

Hi @Swaraj_Badhei

Sorry for the delayed response. You might have understood the concept by this time. However some more pointers are discussed here. Your observation is correct, when pruned model is exported to .h5 format, the model size is reduced and which is as intended.

Usually the pruned model generates large number of zero weights, and hence non-zero weights along their indices need to be stored efficiently where the use of sparse matrix representation plays an important role. Pruned models are stored in sparce matrix representation unlike the other TF or Keras models which are stored in dense format.
No explicit compression takes place, the pruning itself reduces the model size.
To retain the size of your original custom model, either you can save manually the model to .npy format.
Please follow the link for further information

Thank You

Topic		Replies	Views
Pruning pre-trained model with model-optimization General Discussion model_optimization	3	476	August 18, 2023
Tensorflow Model Optimization General Discussion model_optimization , help_request	0	1178	June 10, 2021
Pruned Model Stripping General Discussion model_optimization , help_request	4	2023	May 16, 2022
Keras/TF model file size vs. PyTorch General Discussion models , keras , tensorflow	2	814	September 13, 2023
Problem to convert a pre-trained keras h5 model into a tf.savedmodel.save model, just conversion is okay, no further trainning is needed, General Discussion models , keras , help_request	1	666	May 26, 2022

Packing of pruned model with 65% sparsity

Related topics