Confused about the normalization layer adapt

Dominik_Duleba · November 2, 2022, 11:08am

Hi everyone,

I am just now learning how to use Keras, so this is likely a very newbie question. I am going through the tutorials, I am doing the ‘Load and Preprocess data’ < ‘CSV’.
Below is the relevant code:

#Loading data and splitting features and labels
titanic = pd.read_csv("https://storage.googleapis.com/tf-datasets/titanic/train.csv")
titanic_features = titanic.copy()
titanic_labels = titanic_features.pop('survived')

#Separating the numeric columns as we need to normalize them, and making it into a dict of symbolic tensors
inputs = {}
for name, column in titanic_features.items():
  dtype = column.dtype
  if dtype == object:
    dtype = tf.string
  else:
    dtype = tf.float32

  inputs[name] = tf.keras.Input(shape=(1,), name=name, dtype=dtype)

numeric_inputs = {name:input for name,input in inputs.items()
                  if input.dtype==tf.float32}

#Concatenate the numeric columns
x = layers.Concatenate()(list(numeric_inputs.values()))
#Normalization layer
norm = layers.Normalization()
#Adapt the normalization layer onto the data
norm.adapt(np.array(titanic[numeric_inputs.keys()]))
all_numeric_inputs = norm(x)

So the question is, why are we directing the normalization layer onto the titanic[numeric_inputs.keys()]), which is the raw data with the numeric columns selected. Why wouldn’t we direct it into ‘x’ which is the concatenated numeric values? Also why are we adapting to a data that is turned into an numPy array format? Why not just adapt onto the symbolic tensors since their whole point is supposed to be to keep track of the operations that are done on them. I am very confused about this, and would appreciate if someone could explain or refer me to something that can help me understand how these layers and symbolic tensors are supposed to be set up regarding what should call what in what hierarchy so to say.

Thanks for the help,
Dominik

marcelo_schamber · November 2, 2022, 11:42am

Layer normalization (LayerNorm) is a technique to normalize the distributions of intermediate layers. It enables smoother gradients, faster training, and better generalization accuracy . Normalization layers usually apply their normalization effect to the previous layer, so it should be put **in front of the layer that you want normalized.

Topic		Replies	Views
Calculating mean and variance for the Normalization layer on big datasets General Discussion tfdata	0	1238	July 21, 2021
Reuse Normalization Layer in Multiple Models? Keras models , recommenders	1	391	November 19, 2023
Preprocessing Layers and KerasTuner Cooperation General Discussion models , keras	3	2298	April 6, 2023
concatenate then normalize OR normalize then concatenate TensorFlow models	2	42	December 4, 2024
Normalization using keras.layers.experimental.preprocessing - exceptionally slow General Discussion keras , tfdata , help_request	1	1536	September 26, 2021

Confused about the normalization layer adapt

Related topics