Tier 3 Project – Persistent 503 & 429 Errors in Production (No Communication / Need ETA)

lhm · May 6, 2026, 3:36am

Hi team,

We are currently experiencing ongoing issues with the Gemini API on a Tier 3 project (production environment), and we’re looking for clarification and guidance.

Issue Summary

For the past several days, we’ve been seeing recurring:

503 (Service Unavailable)
429 (Too Many Requests)

These errors are happening consistently and are significantly impacting production usage.

Observations

Errors started a few days ago and have been persisting without resolution
No official communication or incident report found so far
The issue appears intermittent but frequent enough to disrupt service
Error spikes are clearly visible in usage dashboards (attached screenshots)
Occurring even when traffic patterns remain relatively stable

Impact

Production degradation
Failed requests at scale
Unreliable API behavior despite being within expected usage patterns

Questions

Is there an ongoing incident or degradation affecting Gemini API (Tier 3)?
Are these 429s expected (rate limiting changes?) or unintended?
Are the 503 errors related to backend instability or capacity issues?
Is there an ETA for resolution?
Any recommended mitigation strategies on our side?

Additional Context

Tier: 3
Timeframe: last 7 days (also visible over 28 days trend)
Models used: Gemini 2.5 Flash / Flash Lite

Happy to provide more logs or request IDs if needed.

This is a critical production issue, so any visibility would be greatly appreciated.

Thanks in advance.

lhm · May 6, 2026, 3:40pm

Error with keep experiencing:

GoogleGenerativeAIFetchError: [GoogleGenerativeAI Error]: Error fetching from https://generativelanguage.googleapis.com/v1beta/models/gemini-2.5-flash-lite:generateContent: [503 Service Unavailable] This model is currently experiencing high demand. Spikes in demand are usually temporary. Please try again later.

Can anyone from the Google Team help with this?

faisst · May 6, 2026, 7:30pm

We’re facing these errors for over a month now (approx. 30% of our requests fail and we have to retry another 3 or 4 times before it works), with no answer whatsoever until now. I’ve probably answered more than 10 posts and they never replied to me.

Besides that, if you check the Gemini Models in OpenRouter, they all show a very concerning uptime with no status update at the official API status page from google (for both Google AI Studio and for Vertex).

lhm · May 7, 2026, 3:45pm

Thanks for the reply.
The issue is still ongoing on our side and is affecting a production Tier 3 project with recurring 503 and 429 errors over several consecutive days.

This does not appear to be isolated transient throttling, as the error spikes are significant and visible directly in the API dashboard despite relatively stable traffic patterns.

Could someone from the Gemini/API infrastructure team please confirm:

whether there is an active backend degradation,
if rate limiting behavior has recently changed,
and whether there is any ETA or mitigation guidance for production customers?

faisst · May 8, 2026, 3:39pm

It’s clearly just computing restraints. They don’t have enough GPUs, Electricity, etc. to keep a steady uptime for all customers (and their TPUs probably have a lot of restrictions too).

The problem is the lack of transparency and acting like everything is normal, show wrong uptimes in their official API status page, etc.

lhm · May 12, 2026, 3:14pm

To Google/Gemini team:

We still have no clear answer or visibility regarding the ongoing Gemini API instability affecting production Tier 3 projects.

For several weeks now, we have been experiencing persistent 503 and 429 errors across Gemini 2.5 Flash / Flash Lite in a real production environment. The impact is significant and ongoing.

At this stage, what is most concerning is not only the instability itself, but the complete lack of communication and transparency around it.

We are paying customers running production workloads on your infrastructure. We need:

Clear acknowledgement of the issue
Visibility on root cause
Realistic ETA for resolution
Proper status communication when degradation occurs

If the issue is related to backend capacity or insufficient computing resources, this needs to be addressed internally and communicated professionally to developers depending on the platform.

Right now, the API behavior is unreliable despite stable traffic patterns and normal usage behavior on our side.

We are happy to provide logs, request IDs, and additional technical details if needed, but we need actual answers and visibility from the Gemini team.

lhm · May 15, 2026, 5:20pm

@Logan_Kilpatrick How is this issue still not being officially addressed by Google?

It is very concerning and raises real questions about the future reliability of Gemini for production projects.

lhm · May 18, 2026, 5:35pm

This is beyond ridiculous to have such a poor paid service and no customer support.

Jon_Matthews · May 19, 2026, 12:29pm

Hi lhm,

Apologies for the delayed response. I wrote a post about these issues.

The 503’s are indeed at a platform level. The best advice we can offer is:

If possible, move some traffic to the * Batch API.
Use exponential backoff and retry

For the 429’s, I’d suggest checking all the charts on your AIS rate limits page. A common issue for example is exceeding Requests Per Minute for short periods of time or exceeding search grounding limits.

lhm · May 19, 2026, 1:59pm

Hi @Jon_Matthews

Thank you for your message.

Unfortunately, we can’t handle our requests through the Batch API, as we require responses within 60 seconds.

Regarding the rate limits, as you can see below, we are still very far from reaching them.

(The drop in request volume over the last few days reflects the migration we already started toward another API service due to the unreliability of Gemini.)

Topic		Replies	Views
ALL of The Gemini Models Are giving me 503 Error Gemini API ai-studio , api , models	11	1496	January 23, 2026
Frequent 503 Errors (Service Unavailable) across all models Gemini API api , gemini	123	13907	July 8, 2026
Handling 429 / 503 errors from the Gemini API Gemini API gemini-api	62	9687	July 9, 2026
Reducing “Service Unavailable” (503) errors with Gemini API – any enterprise options? Gemini API api , gemini	7	359	April 16, 2026
Anyone knows whts goin on? Gemini API ai-studio , api , gemini	19	611	April 27, 2026