Requires_output_quantize from tfmot is not working as expected

MLEnthusiastic · January 22, 2024, 2:14pm

Hi,

I am working with tfmot to quantize some specific layers in my model. First, I did not annotate my specific layers. I do not understand why in this code here model-optimization/tensorflow_model_optimization/python/core/quantization/keras/quantize.py at e38d886935c9e2004f72522bf11573d43f46b383 · tensorflow/model-optimization · GitHub we check on NOT isinstance rather than isinstance. Because, I think if we use isinstance then the layer is of type QuantizeAnnotate and we will push it to requires_output_quantize as the name suggest? Coming to _quantize now, why do we need to verify if it is not in requires_output_quantize model-optimization/tensorflow_model_optimization/python/core/quantization/keras/quantize.py at e38d886935c9e2004f72522bf11573d43f46b383 · tensorflow/model-optimization · GitHub as this variable will hold layers that do not need to be quantized. I am confused and this sounds contradictory for me.

NB: I have re-implemented the _quantize() function like so:

def _quantize(layer):  # pylint: disable=missing-docstring
        if (
            (layer.name not in layer_quantize_map)
            or (isinstance(layer, quantize_wrapper.QuantizeWrapper))
            or issubclass(type(layer), QuantizeLayer)
        ):
            # It supports for custom QuantizeWrapper.
            print(f"Layer is {layer.__class__}")
            return layer

        if layer.name in requires_output_quantize:
            if not quantize_registry.supports(layer):
                return layer
            full_quantize_config = quantize_registry.get_quantize_config(layer)
            if not full_quantize_config:
                return layer
            quantize_config = qat_conf.OutputOnlyConfig(full_quantize_config)
        else:
            quantize_config = layer_quantize_map[layer.name].get("quantize_config")
            if not quantize_config and quantize_registry.supports(layer):
                quantize_config = quantize_registry.get_quantize_config(layer)

        if not quantize_config:
            error_msg = (
                "Layer {}:{} is not supported. You can quantize this "
                "layer by passing a `tfmot.quantization.tf.keras.QuantizeConfig` "
                "instance to the `quantize_annotate_layer` "
                "API."
            )
            raise RuntimeError(
                error_msg.format(layer.name, layer.__class__, quantize_registry.__class__)
            )

        quantize_config = copy.deepcopy(quantize_config)
        return quantize_wrapper.QuantizeWrapperV2(layer, quantize_config)

I removed layer.name not in requires_output_quantize in the first if statement and it does work for me. But still do not understand why how this could work in general?

Thanks,

Topic		Replies	Views
Custom Quant. on Conv Layers General Discussion help_request , models , tflite	0	589	October 21, 2022
Quantization Aware Training using tensorflow_model_optimization General Discussion tflite , models , model_optimization	4	1242	May 2, 2024
Transfer learning and Quantization aware training. Subclassed model General Discussion models , help_request , keras , model_optimization , learning	2	3001	December 3, 2021
Quantization aware training with quantizationConfig -> 4 % Accuracy loss General Discussion models , help_request , keras , model_optimization	1	1676	April 25, 2022
How to quantize a custom model(not of tf.keras.Model type)? General Discussion keras , models , help_request	1	902	March 6, 2023

Requires_output_quantize from tfmot is not working as expected

Related topics