How to implement a soft update for targetnetworks in tf?

Unnamed · December 17, 2021, 1:55am

I am following a tutorial from a youtuber where the soft update of the target networks are perfromed by this function:

    def update_network_parameters(self, tau=None):
        if tau is None:
            tau = self.tau

        weights = []
        targets = self.target_actor.weights
        for i, weight in enumerate(self.actor.weights):
            weights.append(weight * tau + targets[i]*(1-tau))
        self.target_actor.set_weights(weights)

        weights = []
        targets = self.target_critic.weights
        for i, weight in enumerate(self.critic.weights):
            weights.append(weight * tau + targets[i]*(1-tau))
        self.target_critic.set_weights(weights)

But idk if that’s the proper way. What if there are NoisyDense layers or Normalization layers? Are they updated also using this approach? Is there somewhere a proper implementation? Maybe its already somewhere in the tf lib and idk it…

Dennis · December 17, 2021, 7:09am

Hi @Unnamed, welcome to the Forum.
yes, please feel free to have a look at the Soft Actor Critic agent tf_agents.agents.SacAgent
There’s a target_update_tau factor (+ many more) for soft updates of the target network.

Also used in this colab tutorial:

Lucky Rewards,
Dennis

Topic		Replies	Views
Is the SAC agent in tensorflow up to date? General Discussion tf_agents , help_request	1	1021	November 16, 2021
PPO Problem with Tensorflow General Discussion help_request	0	303	September 4, 2023
ResNet Implementation General Discussion models , help_request	4	590	September 11, 2021
Parameter not updated with gradients at the end TensorFlow help_dev	5	5340	June 30, 2021
How to implement inverting Gradients [PDQN,MPDQN] in Tensorflow 2.7 General Discussion tf_agents , help_request	0	876	January 22, 2022

How to implement a soft update for targetnetworks in tf?

Related topics