Calculating gradient from model with multiple outputs

twofair · March 16, 2025, 11:13am

I have a model that has 16 outputs. I want to calculate the derivative of each output with respect to time. Time is one of the 18 inputs to the model. I tried to code below

def ddt(model,x,t):
    u_t = np.zeros([50,16])
    with tf.GradientTape(persistent=True,watch_accessed_variables=True) as tape:
        tape.watch(t)
        u_pred = model(tf.concat([x,t], axis=1))
    for i in range(16):
         u_t[:,i] = tape.gradient(u_pred[:i],t)
    del tape
    return u_pred, u_t

u_pred shape is (50,16).
However, u_t is only nan.
tape.gradient(u_pred,t) returns a (50,1) tensor of values.
Any suggestions?

I am using tensforflow version 2.10.0

twofair · March 16, 2025, 12:27pm

It looks like I need to use tape.batch_jacobian.
When I compare the batch_jacobian output to np.gradient(u_pred,dt) they do differ. Is the correct assumption that batch_jacobian is closer to the correct answer since np.gradient relies more on the dt time step. For example if dt got smaller and smaller would it converge on batch_jacobian?

Topic		Replies	Views
Tape.batch_jacobian() and tape.gradient() give different results General Discussion education , help_request , tfcore	2	1264	February 28, 2022
Ask for help: "None" value always appears when using tf.GradientTape General Discussion models , gradienttape , tfcompat	1	142	May 13, 2024
Unexpected behavior when using batch_jacobian with multiple inputs/outputs in quantum-classical neural network TensorFlow models , tensorflow	1	41	January 19, 2025
Getting gradient as a sequential model General Discussion api , keras , gradienttape , help_request	1	1696	March 7, 2023
Creating multipel parallel gradient tapes General Discussion help_request	2	451	August 16, 2022

Calculating gradient from model with multiple outputs

Related topics