Is there a way to get confidence score for each word?

coheestore · October 11, 2024, 4:01pm

Hi, in this document Audio understanding (speech only) | Generative AI on Vertex AI | Google Cloud, it said that " * Audio-only timestamps: To accurately generate timestamps for audio-only files, you must configure the audio_timestamp parameter in generation_config.

Is there a way to also get confidence or accuracy score for each word?

Topic		Replies	Views
Audio timestamp accuracy issue in Gemini 2.0 GA models Gemini API help_request , gemini-20	0	221	March 14, 2025
Call to update documentation for Audio Understanding (Refer to timestamps) Gemini API audio , gemini-20 , documentation	1	48	May 31, 2025
Transcribing calls with Gemini - labelling speakers wrong Gemini API gemini	3	189	October 25, 2024
How to get consistent Multi-Speaker Transcription output from Gemini 2.5 Pro? Gemini API api , audio , gemini-25	1	52	June 9, 2025
Gemini 2.0 flash lite timestamp hallucinations for audio but not video since going into GA Gemini API gemini-api , gemini-flash , gemini-20	2	148	May 29, 2025