Convert tensorflow saved_model from float32 to float16

Rasoul · September 26, 2022, 1:49am

I have a tensorflow saved model with float32 weights in a directory structure like below,

large_model\
    variables\
        variables.data-00000-of-00001   (3.6GB)
        variables.index
    saved_model.pb

I would like to cast all weight to float16 in order to reduce the size of the model. I have found tf.cast() method that can be applied to tensors. But it seems that there is no such method for casting the whole weights of a model. I guess that I have to read all layers of the model one by one and manually cast the weight to float16 and then save it using model.save(), but I don’t know how to do that.

Note1: I do NOT have access to the python code of the model definition, only the saved model.

Note2: I do NOT want to save it as tflite format.

dhruvkakadiya · October 9, 2022, 8:24am

@Rasoul did you give a try to steps in here?

Rasoul · October 9, 2022, 11:11am

Well, I have clearly mentioned that I don’t want to use tflite.

Topic		Replies	Views
Data type error when model is saved TensorFlow tflite	1	818	April 13, 2023
How to convert float point .pb model with FakeQuantMinMaxVars nodes to quantized tflite model? General Discussion models	1	254	August 1, 2024
No model size reduction in Tflite model size with integer Quantisation General Discussion models , keras , tflite , model_optimization , help_request	6	2113	July 7, 2021
Load Tensorflow Graph from file then convert to pb or TFLite General Discussion models , tflite , help_request	1	971	September 14, 2024
Sess.run return model weights in bytes from protobuf file. How to handle it? General Discussion models , help_request , tfcore	1	1278	December 7, 2022

Convert tensorflow saved_model from float32 to float16

Related topics