How to define BatchNorm3d in tensorflow

Nada-Nada · August 5, 2024, 12:43pm

PyTorch has three types of BatchNorm layers: BatchNorm1d, BatchNorm2d and BatchNorm3d. TensorFlow has only one BatchNormalization layer.

Are these layers equivalent? I mean, can the three PyTorch layers be defined using that layer from TensorFlow? Are there any parameters of TensorFlow layer that need to be adjusted to have it achieve the behavior of PyTorch layers?

Kiran_Sai_Ramineni · August 6, 2024, 8:52am

Hi @Nada-Nada, In pytorch BatchNorm1d applies batch normalization over a 2D or 3D input, BatchNorm2d applies batch normalization over a 4D input and BatchNorm3d applies batch normalization over a 5D input, but in tensorflow you can use the batchnormalization for the different dimensional input. Thank You.

Nada-Nada · August 6, 2024, 12:09pm

Many thanks @Kiran_Sai_Ramineni
One question please, do I need to set the axis parameter of tensorflow depending on the dimensionality of the input data, or I can still set it to -1 (the default value) for the different input dimensionalities? (I am just trying to replicate pytorch behavior)

Kiran_Sai_Ramineni · August 7, 2024, 4:06am

Hi @Nada-Nada, Yes, the value to the axis parameter will depend upon which dimension the normalization should be applied based upon the input. Thank You.

Nada-Nada · August 7, 2024, 7:46am

Thank you @Kiran_Sai_Ramineni
Can you please explain how to set this parameter to replicate the behavior of:
1- BatchNorm1d layer of PyTorch
2- BatchNorm2d layer of PyTorch
3- BatchNorm3d layer of PyTorch

You help is very much appreciated

Kiran_Sai_Ramineni · August 12, 2024, 10:20am

Hi @Nada-Nada, Please refer to this gist for implementation of BatchNormalization on data having different dimensions using torch and Tensorflow. Thank You.

Nada-Nada · August 13, 2024, 7:29am

Many thanks @Kiran_Sai_Ramineni for the gist. I can see that the output of TensorFlow BatchNormalization (axis=-1) is different than the one from PyTorch BatchNormXd. Do you have an idea please how to set the axis parameter so that they can have the same normalization output?

Again, thank you very much for your help.

Kiran_Sai_Ramineni · August 13, 2024, 11:31am

Hi @Nada-Nada, If you see the momentum(Momentum for the moving average) argument used in BatchNormalization the default value to this argument in pytorch was 0.1 and in Tensorflow it was set to 0.99.

Also the calculation used for this is different in pytorch, Tensorflow during inference and training

Mathematically, the update rule for running statistics here is x^new=(1−momentum)×x^+momentum×xt , where x^ is the estimated statistic and xt is the new observed value.

In Tensorflow it was moving_mean = moving_mean * momentum + mean(batch) * (1 - momentum)

That might be the reason for getting different results. please refer to this similar issue to know more. Thank You.

Topic		Replies	Views
My model seems to have obtained completely different results in train mode and eval mode due to BatchNormalization. What should I do TensorFlow models , tensorflow	3	76	January 20, 2025
BatchNormalization in training mode without updating moving mean and variance? General Discussion api , keras , help_request	3	1054	June 19, 2023
Pytorch mdoule translation to tensorflow General Discussion help_request	2	727	October 25, 2021
ImportError: cannot import name 'BatchNormalization' from 'tensorflow.python.keras.layers' General Discussion keras , python , tensorflow	2	626	January 27, 2025
How to read Batchnorm layer's parameters in TF2.x General Discussion keras , tf-v1 , help_request	1	808	January 2, 2024

How to define BatchNorm3d in tensorflow

Related topics