Timestamp Generation (Forced Alignment) on 2.0-Pro-Exp

Rembrandt · February 17, 2025, 6:47pm

I have been trying to make timestamp generation (forced alignment) work with the Gemini models for some time, using audio files as input - the first model I am finding very consistently generating accurate responses is the latest 2.0-Pro experimental model (gemini-2.0-pro-exp-02-05) - other experimental models in the past would only work intermittently.

Requesting here that this capability be kept in the ultimate GA model - it is extremely useful in particular for certain non-English languages, where reliable forced alignment models do not exist. Would also mention that unfortunately, the 2.0-Flash model fails to generate reliable outputs for this task.

Joe1 · February 17, 2025, 10:24pm

I have been tracking this as well, and in my experience these models all produce accurate timestamps, while the base Flash 2.0 does not.

2.0 Flash Lite
2.0 Flash thinking
2.0 Pro

jkirstaetter · February 18, 2025, 9:05am

Hi there,

Does this apply to audio processing only or is your experience/mileage regarding videos similar?

Cheers.

Rembrandt · February 25, 2025, 6:20pm

Now the GA Flash Lite is breaking for timestamp generation, and the experimental model was working well. Quite frustrating.

If anyone from GOOGL is watching - would be great to understand if timestamps are supposed to be supported, and why it breaks on the new GA model.

Joe1 · March 3, 2025, 8:26pm

I have not tested it on video, so I don’t know.

Joe1 · March 3, 2025, 8:38pm

Confirmed on my end as well. The current Flash Lite timestamps are complete crap. Not even close to being accurate.

Topic		Replies	Views
Gemini Flash 2.0 audio transcription timestamps incorrect Gemini API audio	4	729	March 27, 2025
Timestamp generation (Forced Alignment) on 2.0 production models is still broken Gemini API models , audio	12	434	August 4, 2025
Gemini 2.0 flash lite timestamp hallucinations for audio but not video since going into GA Gemini API gemini-api , gemini-flash , gemini-20	3	243	July 11, 2025
Gemini Pro Timestamp Accuracy Issues in Audio Transcription Gemini API gemini-15 , api	9	736	March 27, 2025
Audio timestamp accuracy issue in Gemini 2.0 GA models Gemini API help_request , gemini-20	0	311	March 14, 2025

Timestamp Generation (Forced Alignment) on 2.0-Pro-Exp

Related topics