The Gemini API is unreliable in the morning (PST)

higi · March 28, 2026, 3:27pm

Hello,

I are building a consumer facing product and rely on several LLM calls. I really like the intelligence/latency trade-off of Gemini under good conditions. However, every day at around 9am PST, the latency of the models I use (gemini 3 flash and gemini 3.1 flash lite) more than triples, so I have to swap to another provider. The degradation lasts several hours. This has been ongoing for a while but has worsened in the last week.

Question for the Gemini API team: are you aware of this, and are planning to fix it? or is the non-vertex api meant to be used only for testing, non-production use cases, and the recommended route for those is vertex? I am also not 100% sure since I have also seen vertex api degrading at around the same timeframe (although by not as much).

Peter_Schroder · March 28, 2026, 6:49pm

The API is extremely unreliable, see this thread for example Handling 429 / 503 errors from the Gemini API - #16

Topic		Replies	Views
Gemini 3.0 flash latency spikes Gemini API models , gemini , gemini-3	0	208	February 11, 2026
Everyday Gemini API slowly degrades around 11.30 am IST, then fails completely Gemini API feedback , performance	8	504	March 5, 2026
Gemini API latency Issues Gemini API bug , api , issues	6	855	September 23, 2025
Extreme latency on gemini-1.5-flash API Gemini API api , models	3	754	January 6, 2025
Unexpected Delay in Gemini-1.5-Flash API Responses Gemini API gemini-15 , api	2	819	November 21, 2024

The Gemini API is unreliable in the morning (PST)

Related topics