Experiencing Extremely High Latency with Gemini 2.5 Flash & Pro

codeonym · November 12, 2025, 1:19pm

I’ve been testing Gemini 2.5 Flash and Pro models in my LangGraph agents, and the experience has been terrible. For example, refining an 8k-character HTML table can take up to 400 seconds. The Flash-Lite model is much faster but produces much worse output.

Has anyone else seen these massive latency spikes with Gemini 2.5 Flash or Pro

Pannaga_J · November 26, 2025, 8:34am

Hi @codeonym
Is this issue still occurring? If so, please provide a comparison of the time it took previously versus the time it takes now. Can you also please let us know if this delay is affecting any other tasks.
Also, please test and compare performance with the gemini-3-pro-preview model to see if any improvement is observed.
Thanks

Topic		Replies	Views
Extreme latency on gemini-1.5-flash API Gemini API api , models	3	791	January 6, 2025
Persistent High Latency with `gemini-2.5-pro` Gemini API generative-ai , gemini-2-5	4	1282	July 26, 2025
Increased Latency in the Gemini 2.5 Flash API Gemini API gemini , gemini-flash	1	342	December 23, 2025
Gemini-2.5-pro accessed over https://generativelanguage.googleapis.com/v1beta/openai/ has dramatic latency increase Gemini API api , model , gemini-2-5	10	1181	July 21, 2025
Increased Latency in Gemini 3 pro and Flash Gemini API api , gemini-3	2	405	February 19, 2026

Experiencing Extremely High Latency with Gemini 2.5 Flash & Pro

Related topics