Hi, in this document Audio understanding (speech only) | Generative AI on Vertex AI | Google Cloud, it said that " * Audio-only timestamps: To accurately generate timestamps for audio-only files, you must configure the audio_timestamp
parameter in generation_config
.
Is there a way to also get confidence or accuracy score for each word?