The gradient in the update_step() function in Keras

dali_dali · January 19, 2023, 8:50pm

In tf.Keras , the function update_step() is used by an optimizer to update the variables at every iteration given the variable and the gradient. I want to know if this gradient is the full gradient of all the samples in the batch or the gradient for a sample in the batch?
thank you

chunduriv · January 23, 2023, 2:57am

@dali_dali,

When using the model.fit() or the model.train_on_batch() the optimizer’s update_step() is called after the forward pass and the computation of the loss for a batch of data.

The gradients passed to the update_step() function are the full gradients of all the samples in the batch (i.e batch gradient).

Thank you!

Topic		Replies	Views
Code error using Gradient Tape General Discussion help_request , tensorflow	2	1776	July 13, 2022
Train_on_batch and train_step used in custom training loop giving different results Keras models , keras , help_request	1	1235	October 15, 2024
tf.IndexedSlices gradients and dense gradients in keras General Discussion api , keras	2	629	March 10, 2023
Getting gradient as a sequential model General Discussion api , keras , gradienttape , help_request	1	1694	March 7, 2023
Parameter not updated with gradients at the end TensorFlow help_dev	5	5340	June 30, 2021

The gradient in the update_step() function in Keras

Related topics