Is the SAC agent in tensorflow up to date?

Janis_Taranda · November 16, 2021, 2:40am

I found this:

…there is a new version of the algorithm that uses only a Q function and disposes of the V function. It also adds automatic discovery of the weight of the entropy term called the ‘temperature’

By looking at the current code, I do not see this temperature parameter, or is it just named different?

Reference:

Sergio_Guadarrama · November 16, 2021, 10:54pm

Hi Janis, those are two different algorithms, CQL-SAC and SAC.

CqlSacAgent implements the CQL algorithm for continuous control domains from “Conservative Q-Learning for Offline Reinforcement Learning” (Kumar, 20).

The SACAgent uses target_entropy described in the paper and the Automating Entropy Adjustment described in the paper.

Topic		Replies	Views
Tf-agents & on-bot training -- am I barking up the right tree? TensorFlow reinforcement-learni , tf-agents	0	16	February 1, 2025
About SAC minitaur tutorial General Discussion tf_agents , help_request	0	908	November 17, 2021
Connection between Agents and Policies General Discussion docs , help_request	11	1138	June 3, 2021
How to implement a soft update for targetnetworks in tf? General Discussion tf_agents , help_request	1	1436	December 17, 2021
Deep Q Network: State-of-Art TensorFlow models , gpu , tf_agents	0	1149	April 28, 2022

Is the SAC agent in tensorflow up to date?

Related topics