429s in Vertex AI for Gemini-2.5-Flash-Lite in Europe

rp2799 · February 23, 2026, 12:48pm

Hi.

As other posts have noted, there seems to be a persistent bug that leads to 429 errors for europe endpoints. In our case, this is for gemini-2.5-flash-lite.

It makes vertex AI extremely unreliable, and despite exponential backoff - we constantly get ‘too many requests’ and ‘resource exhausted’ for periods of a few hours, which then goes away.

Our code is configured to try all the europe endpoints, yet we get this error more or less regardless of where we try.

Are there any SLAs in place for this, and is this a known issue for which a fix is being deployed? Our customers are unhappy with the latency and we will ultimately switch to another provider if this persists.

duxyz · February 24, 2026, 1:09am

Yes, I have exactly the same issue, no idea what is going on, status shows that it’s all okay but getting 429 on most requests.

How can we get someone to look into this?

mkaloer · February 24, 2026, 8:11am

Same here, 98 % of our gemini requests ended up with status 429 in a period over 12 hours. We have failovers to all EU regions, but that did not help.

rrc102 · February 24, 2026, 1:13pm

And here. I don’t know how you are supposed to use this service for production, it’s a throw of the dice if it’s going to work or not.

f.a · February 24, 2026, 5:37pm

Same problem here! Keep getting 429 while trying to use Gemini-2.5-flash-lite via Vertex AI on paid tier 3. I am using the same failovers approach as @mkaloer but it doesn’t seem to work.
Any update from the technical team would be appreciated!"

mkaloer · February 25, 2026, 5:54am

Hi all,
Got this from Google Cloud support. I’ll keep you updated if I hear more.

Hello ,

Thank you for reaching out. I have taken a closer look and it appears that your issue is related to a product outage that has been resolved as of 2026-02-24 05:37 PST. Our team is still working on investigating the root cause of the issue. I will keep you posted once hearing from our team.

Kora_Rohan · February 25, 2026, 2:24pm

Same, facing this even with US servers. Extremely unreliable

rp2799 · March 2, 2026, 11:59am

Does anyone from Google care to comment on this? Would be nice to understand when this can be resolved.

Dino_Fancellu · March 2, 2026, 2:02pm

I’m getting 503s now for gemini-2.5-flash-lite

ApiError: {“error”:{“code”:503,“message”:“This model is currently experiencing high demand. Spikes in demand are usually temporary. Please try again later.”,“status”:“UNAVAILABLE”}}

It would be nice to have a proper status page which shows true error status and not just the ones they choose to acknowledge

rp2799 · March 30, 2026, 9:13am

if anyone from google cares to respond to us that would be great. otherwise, if you are reading this and are not currently using vertex AI, don’t bother. use something else honestly.

Topic		Replies	Views
Sudden Spike in 429 Errors with Gemini 2.5 via Vertex AI Global Endpoint Gemini API vertexai , gemini	7	1188	April 8, 2026
Unexpected 429 Errors on Vertex AI (Gemini 2.0/2.5 Flash Lite) via Firebase Functions despite <0.1% quota usage Gemini API bug , vertexai , gemini , vertex-ai , firebase	2	267	April 5, 2026
Tier 3 Project – Persistent 503 & 429 Errors in Production (No Communication / Need ETA) Gemini API api	9	219	May 19, 2026
Issue with 429 Error on Gemini API Despite Staying Within Rate Limits Gemini API gemini-api	13	1892	March 10, 2026
Excessive 429 errors today – rate limit or availability issue? Gemini API rate-limits	3	123	May 12, 2026

429s in Vertex AI for Gemini-2.5-Flash-Lite in Europe

Related topics