Quantization spec for 16x8 quantization

Jozef_C · June 13, 2025, 10:21am

Hi all,

I am quantizing models to 16x8 bit precision, but I cannot find any information on the actual spec for the quantized ops. The 8-bit version has a nice overview of the spec here LiteRT 8-bit quantization specification | Google AI Edge | Google AI for Developers

What I would like to know is whether there is some blanket format for all the ops in 16x8 e.g. all the ops are 8-bit symmetric weights/16-bit symmetric activations, or 8-bit symmetric weights/16-bit asymmetric activations etc.

Would anyone know where to find this information? Thanks!

Topic		Replies	Views
Post-training quantization. Where to learn more? General Discussion tflite , education , help_request	3	1265	June 5, 2024
Specify precision in TFLite models TensorFlow models , tflite , help_request	1	742	January 2, 2024
Where can I find sysmetric TFlite quantization model? TensorFlow tflite , feature_request , model_optimization	1	1828	December 9, 2021
Multiple precision in tflite quantization not supported int16 for input General Discussion help_request	1	360	June 21, 2024
How does full integer quantization work? Micro tflite , help_request	1	1156	March 8, 2024

Quantization spec for 16x8 quantization

Related topics