Quantization of a CNN with 4 bit

Hi, I need to quantize a small cnn. After the training I would like to see weights and bias quantized with 4 bit precision. I’m using Tensorflow model optimization but I always see floating point at the end like many other libraries. With Tensorflow lite I can see 8 bit precision for weights while bias remaining 32 bit.

Can you help me suggesting a way to solve this problem? Any help is welcome.

Thank you so much for your attention.

Hi @claudio137, At present 4 bit qutization is not available in tensorflow. Only int8, float16 are minimum quantization supported. Thank You.