Tf-agent metrics

dka · May 24, 2022, 3:19am

Hello, I am pretty new to TF-Agents and feel confused about the used metrics in the replay_buffer, dynamic driver and the agent training.
I would really appreatiate if someone can give me a brief explanation about the following terms:

I have created 4 environments and put them all together in a BatchedPyEnvironment and then converted it to a TFPyEnvironment. so the batch_size of this environment is 4.
Then I created a TFUniformReplayBuffer , so what are the 1.batch_zise and 2.max_length ? I understand the batch size is how many elements is stored in the batch, but when I change the max_length value , nothing really happens, unless it is 1 it gives an error.
my observation is a (1,25) vector of integers.
Then to read the replay buffer, I create a dataset outside the training loop ,through replay_buffer.as_dataset , should this dataset be created at each training iteration? , also what is the difference between the batch_size in the TFUniformReplayBuffer and the 3.sample_batch_size ?
when I change the num_steps from 2 to 1, it also give an error, so what does this 4.num_steps mean?
then I create an iterator = iter(dataset) also outside the training loop.
Also when I see the 5.number of episodes, and 6.number of steps by
env_steps.result().numpy() and num_episodes.result().numpy() after the training loop, it shows different numbers that I can’t control every time I run the training.
I create a step driver to collect experience inside the loop dynamic_step_driver.DynamicStepDriver , but I’m also not sure I got what is the 7.num_steps in it really representing, the training loop I can control the 8.num_iterations , but I though this will be the same number of episodes I get from num_episodes.result().numpy() and the number of steps I get from env_steps.result().numpy() , would be the same as num_steps in dynamic_step_driver.DynamicStepDriver , but they are all different.
Any hints will be very helpful, Thanks!

Little_Dave · May 27, 2022, 7:13am

Check this out, this might help : Module: tf_agents.metrics.tf_metric | TensorFlow Agents

Topic		Replies	Views
TF-agents mismatched trajectory spec General Discussion help_request , tf-agents	0	641	March 22, 2022
Why should I set 'steps_per_epoch' manually? General Discussion datasets	5	2165	September 2, 2023
Tf_agents episodic replay buffers General Discussion tf_agents , help_request	0	963	October 25, 2021
PPO Problem with Tensorflow General Discussion help_request	0	303	September 4, 2023
Time_step doesn't match 'time_step_spec' in a custom py_environment General Discussion datasets , timeseries , array	9	585	December 1, 2023

Tf-agent metrics

Related topics