Just to recommend and inform people to place a maxOutputToken limit when generating TTS it seems that if you put the temperature at 0.3 it will generate audio for the first sentence and later on go silent until the token output limit is reached, sadly the silence is still being charged .. Going lower it seems that they will still consume you tokens but no access to the output
.