Gemini 3.1 Flash Lite – frequent 503s and high latency recently?

Ethan_W · April 8, 2026, 4:05am

Been using Gemini 3.1 Flash Lite in a small project and over the past week or so it’s become pretty unreliable. Getting a lot of 503s:

And when requests do succeed, latency is noticeably worse than before — simple prompts that used to return in under a second are now taking 5–10s.

I’m well within rate limits so it’s not a quota thing. Retries help sometimes but not always. Is this a known issue on the backend? Anyone else seeing this?

mysubcult · April 8, 2026, 7:25am

The same problem, sadly, I don’t know what to do about it.

ernestknurov · April 9, 2026, 1:00pm

I also got similar issue I used to check gemini 3.1 flash lite preview speed like month ago and it was comparable with gemini 2.5 flash lite, around 1.5s for my simple task. Now it’s 3s for gemini 3.1 flash lite preview

rryyqn · April 14, 2026, 10:46am

Gemini hasn’t yet upscaled the resource allocation for this model. It’s still in preview so all we can do is hope the team can work on this. I’ve been getting 503’s consistently for maybe 20 hours, specifically with 3.1 flash lite model.

rupam_Rakshit · April 14, 2026, 12:29pm

same problam I’ve been getting 503’s consistently for maybe 10 hours

Topic		Replies	Views
Gemini 3.1 Pro Preview API requests failing/timing-out Gemini API api , gemini , gemini-3	13	2065	March 5, 2026
503 errors with gemini 2.5 pro Gemini API api , gemini	1	417	August 27, 2025
Many 503s from Gemeni-2.5 Flash and Pro Gemini API gemini-flash-2-5	3	148	January 14, 2026
Persistent 503 errors with Gemini 3.1 Pro Preview Gemini API gemini-3	1	58	May 16, 2026
Frequent 503 Errors (Service Unavailable) across all models Gemini API api , gemini	122	12792	June 16, 2026

Gemini 3.1 Flash Lite – frequent 503s and high latency recently?

Related topics