Gemini 2.5 Flash audio response latency doubled / tripled after 3.0 Pro release?

Richard_Wong · December 2, 2025, 7:09pm

Anyone experiencing this?

Audio (non-streaming) use case. No code change on our side and config has thinking budget set to 0. Not sure what’s the exact date for the regressions, but starting in late November, right after 3.0 Pro release.

Can Google folks take a look? Thanks!

icapora · December 3, 2025, 1:30am

Hey @Richard_Wong,
yes, we are experiencing the same behavior. When sending all the audio at end_of_speech, the response shows a noticeable delay.

In this post I shared a small workshop where I explain the issues, pros, and cons of using audio streaming vs. end_of_speech, in case it helps anyone as an example or reference while debugging this behavior:

Hope it’s useful!

sonken625 · December 3, 2025, 2:49am

I’m experiencing exactly the same issue. It works perfectly with text input, but only audio inputs are affected.

sonken625 · December 3, 2025, 2:52am

The instability is so severe that I’m seriously considering switching to OpenAI’s API instead.

Srikanta_K_N · December 17, 2025, 10:21am

Hi @Richard_Wong, apologies for the delayed response.

My understanding is that you are trying to get an audio response for your query using the 2.5-flash model and facing latency issues in that.

To understand the issue better, could you please elaborate on what you are trying to achieve and possibly share a snippet of code, so that we can take a look?

Thank you!

Topic		Replies	Views
Latency problems API gemini 2.0 flash multimodal life Gemini API api , audio , gemini-flash , gemini-20	2	156	March 25, 2025
Increased Latency in the Gemini 2.5 Flash API Gemini API gemini , gemini-flash	1	74	December 23, 2025
Gemini Live API models high Latency Gemini API api , models , gemini	11	418	December 11, 2025
Inconsistent Response Behavior in gemini-2.5-flash-native-audio-preview-09-2025 Voicebot Gemini API ai-studio , live-streaming	4	381	December 7, 2025
Extreme latency on gemini-1.5-flash API Gemini API api , models	3	688	January 6, 2025

Gemini 2.5 Flash audio response latency doubled / tripled after 3.0 Pro release?

Related topics