TFLiteConverter adds (de)quantization blocks before(and after) operations on a weight variable

Malek_Itani · January 27, 2023, 4:05am

I’m moving the discussion of this issue here.

Briefly, I’m trying to convert a TensorFlow model to TFLite. This model should maintain a non-trainable state vector that I update at every inference, similar to an RNN. My issue is that when I try to convert it to TFLite, this state vector doesn’t seem to get interpreted as a quantized value, so the network adds quantization and dequantization operations every time it needs to read/write from it.

The model, and corresponding Netron graph, are found in the referenced GitHub issue. They can be found and reproduced with this gist.

Ultimately, I would like to deploy this on a Coral EdgeTPU, but I would like to minimize unnecessary ops, such as the quantize/dequantize blocks. This should be possible since I want the state vector to be int8 and not float32 in the TFLIte model. How can I do that?

chunduriv · February 17, 2023, 7:52am

@Malek_Itani,

Welcome to the Tensorflow Forum!

Sorry for the delay in responding. We see there is an active discussion about this issue here #59390.

Please let us know if you need any assistance here.

Thank you!

Malek_Itani · February 18, 2023, 12:45am

Hi @chunduriv

Thank you for the reply. I started this discussion here out of a suggestion from, what was then, the issue assignee to the GitHub issue you’ve linked. I still haven’t found a solution to the problem. If you have ideas or insights, I would love to hear them.

Thank you

Topic		Replies	Views
No model size reduction in Tflite model size with integer Quantisation General Discussion models , keras , tflite , model_optimization , help_request	6	2114	July 7, 2021
Integer Quantization of LSTM model General Discussion models , keras , tflite , model_optimization , help_request	3	3063	June 30, 2021
QAT training: convert input/output to 8 bits instead of float32 General Discussion models , keras , tflite	7	1417	May 16, 2023
Doesn't TFLite-micro's Dequantize operation support float16? General Discussion tflite , tflite_micro	1	101	June 25, 2024
TFLite conversion_Float16 quantization General Discussion models , tflite , help_request	1	978	September 6, 2022

TFLiteConverter adds (de)quantization blocks before(and after) operations on a weight variable

Related topics