Model is overloaded -

nrathi · April 30, 2025, 3:11pm

I’m seeing the same, since 6am Eastern today.

Is this the right status page? There has not been an incident report. Google AI Studio

zlmk · April 30, 2025, 4:40pm

We’re paid tier 1. Same issue.

Jono_Stiansen · April 30, 2025, 7:53pm

I have never seen an issue produced for gemini outages, it’s frustrating. I’d recommend filing support tickets to put some pressure on GCP to take these outages seriously. You can go to your projects support @ https://console.cloud.google.com/support/cases

zlmk · April 30, 2025, 8:22pm

Which product to choose? I am not seeing a non-Vertex-AI Generative AI / Gemini API product listed there.

netlifedev1 · April 30, 2025, 11:34pm

Its really frustrating when this occurs, and completely puts me off utilising in the case of production applications. I am having to write custom monitoring specifically for the Gemini models so that I can track and report these incidents internally.

I really feel that Google should consider including model overloads as an outage category.

We are running at 5% our assigned Gemini capacity (paid user with agreed raised tiers).

Its Gemini 2.0 Flash having the issue.

Gemini 2.5 Flash is not overloaded still, but we can’t switch to it as it underperforms for our use cases.

zlmk · May 1, 2025, 11:49am

Totally agree. We’ve been relying on Gemini 2.0 Flash on some critical tasks and it’s outperforming 2.5 Flash AND 2.5 Pro.

I don’t know what happens to Google, but they haven’t been really showing much care about the developer community. At OpenAI’s Dev forum, there are a bunch of OpenAI engineers and PMs helping and listening.

Google, if you are listening, talk to us about what use cases we care about and train/release models better than previous models not worse. Do NOT overfit to benchmarks!

Vishal · May 1, 2025, 10:18pm

Hey folks, sorry for the issues here. The 2.0 Flash issue should be resolved, please let me know if not.

Tiago_Melo · May 2, 2025, 1:02pm

When can we expect the issue to be solved for larger, more recent models like 2.5 Pro? Right now, if I try to parse a document with >8 pages I have to implement retries with some exponential backoff, with delays hitting more than 1h since the server keeps disconnecting. This makes building production apps with Google Gemini impossible

Pratik_Behera · May 14, 2025, 9:20am

Hey Vishal
I am still facing the issue with Gemini-2.0-flash. I am using retries and backoff with 5 and 5 values.

503 UNAVAILABLE. {'error': {'code': 503, 'message': 'The model is overloaded. Please try again later.', 'status': 'UNAVAILABLE'}}

Could you please tell, by when will this be resolved

Thanks a lot

Pau_Monserrat · June 3, 2025, 10:59am

Hello folks!

Is someone experiencing error 503 with the Gemini API these past days and also today? I went to the status webpage (Google AI Studio), where it says the API have been working well the last days.

Pau_Monserrat · June 3, 2025, 11:14am

By the way, the model I’m facing more problems with is Gemini 2.0 Flash.

Ben_Manashe · June 3, 2025, 12:00pm

Yeah I’m seeing the same, today in particular…basically not working at all! I’ve been testing for bugs on my side, but can’t see anything besides the 503 and the service being overloaded/unavailable…anyone else?

Gustavo_Herrera · June 3, 2025, 2:55pm

Same for me. Swapping from gemini20Flash to gemini20FlashLite solved the problem (for now).
Seriously considering switching to OpenAI.

AleixGG · June 3, 2025, 3:17pm

Same here, having problems since 2-3pm CET

Jesus_Abril · November 7, 2025, 4:04pm

Is a bit annoying, manny often.
The model is overloaded. Please try again later

dhnation · January 28, 2026, 4:02pm

Yes, the only workaround I have found is to use other models from other companies as a fallback. It wouldn’t give the same quality, but at least it’ll return something. For example, you could use Chinese models, or for upscaling purposes Runware. And if there will come a model from another company than Google, that is more durable, we could simply move entirely to those. It’s ridiculous that we are STILL facing this error, I just had it a minute ago once again.

dhnation · January 28, 2026, 4:04pm

It’s frustrating to see, that this topic was opened on January 2025. We almost have February 2026 and this error STILL occurs.

Topic		Replies	Views
Error: The model is overloaded Gemini API model	63	41603	January 28, 2026
Frequent 503 "The model is overloaded" errors on Gemini 2.5 Flash Gemini API model , gemini-flash-2-5	19	2513	January 26, 2026
Getting a lot of "service unavailable" errors on gemini-2.0-flash Gemini API api , gemini-flash , gemini-20	21	1726	November 7, 2025
Gemini 2.5 Pro: The model is overloaded. Please try again later Gemini API model , gemini_25_pro	17	2757	November 17, 2025
Every day 503 errors with msg model is overloaded Gemini API api , model	5	461	August 23, 2025

Model is overloaded -

Related topics