Quantizing the models using tflite

saras26 · January 3, 2024, 11:33pm

Hi Everyone, I am trying to quantize my model to 4-bit using post-training quantization, I see tflite supports int 8 bits and float 16 bit. How can do post-training quantization to convert my float 32-bit model to int 4-bit model . Is it possible ? Kindly provide your views

Regards
Saras

Kiran_Sai_Ramineni · January 4, 2024, 6:50am

Hi @saras26, Currently there is no support for int 4 quantization. Only int8, float16 are minimum quantization supported. Thank You.

saras26 · January 4, 2024, 3:51pm

Thank you so much for the clarification.

Topic		Replies	Views
Quantization of a CNN with 4 bit TensorFlow model-optimization , cnn	1	100	February 13, 2025
Viewing Weights after Performing post Training Quantization General Discussion tflite , model-training	2	373	January 10, 2024
QAT training: convert input/output to 8 bits instead of float32 General Discussion models , keras , tflite	7	1485	May 16, 2023
While doing quantization, Is it possible to specify the scale and zero point to tensorflow int8 kernel? General Discussion models , tflite , model_optimization , help_request	3	2193	January 30, 2023
4 bit quantization aware training General Discussion model_optimization , training , help_request	1	2088	October 19, 2021

Quantizing the models using tflite

Related topics