Plotting gradient computation graph

Cola_Lightyear · March 18, 2025, 2:38pm

Hi, to debug the origin of NaN values in gradients I’d like to plot the graph of the gradient computation.

Is this possible in tensorflow? I looked around a lot and did not find a working solution.

Cola_Lightyear · March 19, 2025, 11:56am

The question is obsolete - I was able to solve my NaN problem. For the sake of future readers: I used tf.norm somewhere, where I had forgotten it is used (Vector Quantizer). tf.norm is numerically unstable for small values. You have to modify it according to:

def safe_norm(x, epsilon=1e-12, axis=None, keepdims=False):
    return tf.sqrt(tf.reduce_sum(x ** 2, axis=axis, keepdims=keepdims) + epsilon)

I found this variant on github. Approach the issue similarly for anything using tf.sqrt.

Topic		Replies	Views
Gradient on matrix inverse General Discussion tfdata , gradienttape	2	308	February 6, 2025
Nan loss occurring when training transformer model for machine translation General Discussion machine-learning , model , training	1	173	January 9, 2025
Tf.gradients() vs tf.gradientTape.gradient() in graph mode General Discussion education , help_request , tfcore	1	1250	September 22, 2023
Accuracy issue to compute the gradients General Discussion help_request , tfcore	1	623	July 7, 2021
Backward computational graph General Discussion keras , help_request , tfcore	4	915	September 15, 2021

Plotting gradient computation graph

Related topics