Changing augmentation parameters in TFOD API

Sayak_Paul · January 7, 2022, 10:53am

I am aware of the preprocessing proto that is used in the models repo:

tensorflow/models/blob/master/research/object_detection/protos/preprocessor.proto

syntax = "proto2";

package object_detection.protos;

// Message for defining a preprocessing operation on input data.
// See: //third_party/tensorflow_models/object_detection/core/preprocessor.py
// Next ID: 41
message PreprocessingStep {
  oneof preprocessing_step {
    NormalizeImage normalize_image = 1;
    RandomHorizontalFlip random_horizontal_flip = 2;
    RandomPixelValueScale random_pixel_value_scale = 3;
    RandomImageScale random_image_scale = 4;
    RandomRGBtoGray random_rgb_to_gray = 5;
    RandomAdjustBrightness random_adjust_brightness = 6;
    RandomAdjustContrast random_adjust_contrast = 7;
    RandomAdjustHue random_adjust_hue = 8;
    RandomAdjustSaturation random_adjust_saturation = 9;
    RandomDistortColor random_distort_color = 10;
    RandomJitterBoxes random_jitter_boxes = 11;

This file has been truncated. show original

My question is how one configures their augmentation pipeline when using the TFOD API. Consider this configuration file. It has a field for augmentation:

train_config: {
  ...
  data_augmentation_options {
    random_horizontal_flip {
    }
  }

If I wanted to expand the set of augmentation transformations here what should I do?

@Laurence_Moroney @khanhlvg any pointers?

Mark_Daoust · January 10, 2022, 6:15pm

Interesting.

This looks like they’ve re-encoded something like keras’s model.get_config, as a proto.

To change the data-augmentation, you edit that data_augmentation_options list.

The .proto files define what’s allowed. The definition of TrainConfig is here:

github.com

tensorflow/models/blob/aa3e639f80c2967504310b0f578f0f00063a8aff/research/object_detection/protos/train.proto#L25


      
          
          // Message for configuring DetectionModel training jobs (train.py).
          // Next id: 31
          message TrainConfig {
            // Effective batch size to use for training.
            // For TPU (or sync SGD jobs), the batch size per core (or GPU) is going to be
            // `batch_size` / number of cores (or `batch_size` / number of GPUs).
            optional uint32 batch_size = 1 [default=32];
          
            // Data augmentation options.
            repeated PreprocessingStep data_augmentation_options = 2;
          
            // Whether to synchronize replicas during training.
            optional bool sync_replicas = 3 [default=false];
          
            // How frequently to keep checkpoints.
            optional float keep_checkpoint_every_n_hours = 4 [default=10000.0];
          
            // Optimizer used to train the DetectionModel.
            optional Optimizer optimizer = 5;

data_augmentation_options is a repeated PreprocessingStep.

A PreprocessingStep is one of the items from that list. The parameters of each and their default values are defined in preprocessor.proto

If you want to add a RandomScale step:

train_config: {
  ...
  data_augmentation_options {
    random_horizontal_flip {
    }
   random_image_scale {
       min_scale_ratio: 0.9
       max_scale_ratio: 1.1
    }
  }
}

That format is “proto-text” (.PBTXT), you can check your syntax with:

from google.protobuf import text_format

train_config = TrainConfig()
train_config = text_format.Parse(
        r"""
        train_config: {
          ...
          data_augmentation_options {
            random_horizontal_flip {
            }
           random_image_scale {
               min_scale_ratio: 0.9
               max_scale_ratio: 1.1
            }
          }
        }
        """, train_config)
print(train_config)

Sayak_Paul · January 11, 2022, 1:33am

Wonderful. Thanks for the detailed explanation.

Teo_Blicard · September 22, 2022, 7:55am

Hello, how do you import TrainConfig() in order to do train_config = TrainConfig() please ?

Topic		Replies	Views
Apply customized augmentations using tensorflow object detection api TensorFlow datasets , model_garden , help_request	1	556	December 18, 2023
Data Augmentation in TensorFlow Object Detection API TensorFlow model_garden , help_request	1	702	April 18, 2023
Import Error for Update Config for Transfer Learning General Discussion model_garden , help_request	4	1883	November 28, 2022
How to multiple image augmentations which don't have layers? General Discussion keras , pytorch	3	964	December 23, 2022
Config_util import problem General Discussion install , help_request	1	716	June 14, 2023

Changing augmentation parameters in TFOD API

Related topics