Tensorflow (2.9.1) : Changing the 'trainable' attribute on a layer during training

ankit1089.sony · June 8, 2022, 8:23am

Consider the following model:

class MyModel(keras.Model):
    def __init__(self, **kwargs):
        super(MyModel, self).__init__(**kwargs)
        
        self.square_layer = keras.layers.Dense(2)
        self.cube_layer = keras.layers.Dense(2)
        
        self.optimizer = tf.keras.optimizers.Adam()
    
    @tf.function
    def call(self, X):
        return tf.stack([self.square_layer(X), self.cube_layer(X)], axis=-1)
    
    @tf.function
    def train_step(self, inputs, targets):
        with tf.GradientTape() as tape:
            predictions = self(inputs)
            loss = tf.reduce_mean(tf.square(predictions - targets))
        grads = tape.gradient(loss, self.trainable_weights)
        self.optimizer.apply_gradients(zip(grads, self.trainable_weights))
        return loss

If we train using the following ‘train’ function, and set ‘self.cube_layer.trainable’ as True or False, the result is as expected in both the cases:

    def train(self, inputs, targets, num_epochs=5000):
        self.cube_layer.trainable = False  # True or False
        self.compile(optimizer=self.optimizer)
        for epoch in range(num_epochs):
            loss = self.train_step(inputs, targets)
            
        print("Loss: " +str(loss))

inputs = tf.constant([[1,2]], dtype=tf.float32)
targets = tf.constant([[[3,6], [9,12]]], dtype=tf.float32)

model = MyModel()
model.train(inputs, targets)
print(model(inputs))

But, if we change the ‘trainable’ flag during training, the result is not as expected:

    def train(self, inputs, targets, num_epochs=5000):
        self.cube_layer.trainable = False
        self.compile(optimizer=self.optimizer)
        for epoch in range(num_epochs):
            loss = self.train_step(inputs, targets)
        
        self.cube_layer.trainable = True
        self.compile(optimizer=self.optimizer)
        for epoch in range(num_epochs):
            loss = self.train_step(inputs, targets)
            
        print("Loss: " +str(loss))

inputs = tf.constant([[1,2]], dtype=tf.float32)
targets = tf.constant([[[3,6], [9,12]]], dtype=tf.float32)

model = MyModel()
model.train(inputs, targets)
print(model(inputs))

In the above example, if we remove the ‘@tf.function’ decorators from ‘call’ and ‘train_step’, the result is as expected ! So, I believe it has something to do with tf.function and tensorflow graph compilation. Is there a way we can use tf.function and set the ‘trainable’ attribute dynamically during training ? I am using tensorflow 2.9.1.

ankit1089.sony · June 9, 2022, 1:18am

Solved the problem.
Refer: tensorflow2.0 - Tensorflow (2.9.1) : Changing the 'trainable' attribute on a layer during training - Stack Overflow

Amin_Jabari · June 9, 2022, 10:05am

did you get any proper solution ?

ankit1089.sony · June 14, 2022, 3:54am

Yes, a proper solution is listed here:

Topic		Replies	Views
How to implement LayerDrop in TensorFlow Transformers General Discussion nlp , help_request	6	1345	June 26, 2021
Different results between fit and appy_grad General Discussion api , help_request	0	454	July 25, 2022
How to make the _make_train_function of TF1.15 similar as make_train_function of TF2 General Discussion tf-v1 , tensorflow	1	439	October 27, 2023
Model does not train properly when explicitly applying the gradients General Discussion models , keras , help_request	1	654	August 2, 2022
First time subclassing a model. Need somehelp General Discussion models , keras , help_request , tf-probability	1	762	December 11, 2022

Tensorflow (2.9.1) : Changing the 'trainable' attribute on a layer during training

Related topics