Retrain TF-Agents model with changed parameters (Checkpointer or PolicySaver)

Sarah_Riedmann · May 26, 2021, 3:30pm

Hello everyone!

I am trying to load a deep q model and retrain it with different parameters (e.g. changed epsilon value). I am using a config file for setting various parameters of the environment and the agent.
What is the best way to go about this? Should i use Checkpointer, PolicySaver or something else entirely?

I would be very grateful for your help!
Thank you!

8bitmp3 · May 26, 2021, 3:45pm

I’d have to experiment with this too. Since Checkpointer allows you to save and load not just the policy network state, but also the training state, I’d give that a go. In addition, it seems like it also caches into a replay buffer, which would be useful for sampling in DQN’s case. (tf_agents.utils.common.Checkpointer | TensorFlow Agents)

Hope this helps a little. cc @yablak @markdaoust

8bitmp3 · May 27, 2021, 6:26pm

@Sarah_Riedmann, spoke with tensorflow-agents team member and they advised PolicySaver - it’s TF-Agents specific for saving a policy (with the step, and other info etc). PolicySaver uses Checkpointer underneath. Hope this helps!

Sarah_Riedmann · May 31, 2021, 11:00am

Thank you @8bitmp3! I will experiment with PolicySaver.

Sarah_Riedmann · May 31, 2021, 12:36pm

I have tried using PolicySaver and it works so far. However, I would like to set the loaded policy as the DQN agent’s policy. Is it possible to continue using the agent with the saved policy or do i have to use loaded_policy.action() manually?

Any help would be appreciated! Thank you.

Topic		Replies	Views
Connection between Agents and Policies General Discussion docs , help_request	11	1142	June 3, 2021
How to load a TensorFlow model to retrain it without the optimizer states being reset? General Discussion models , help_request	3	859	October 27, 2023
PPO Problem with Tensorflow General Discussion help_request	0	306	September 4, 2023
Correct way to retrain a Keras model Keras models , keras	9	2503	October 23, 2023
Loading Keras Model with Custom Optimizer Returns None: Seeking Solutions General Discussion models , keras	1	451	October 6, 2023

Retrain TF-Agents model with changed parameters (Checkpointer or PolicySaver)

Related topics