Error: The model is overloaded

David_Richards · February 7, 2025, 1:59am

Same here. Most requests are getting this 503 error. I tried switching to the VertexAI (@google-cloud/vertexai) package to see if that made a difference since the @google/generative-ai package seems less actively developed. That seemed to work, but I had limited success fully testing since the API limits are unfortunately defaulted to 5 RPM. I filed a quota increase request to match the 1000 RPM you get with the paid tier using the generative-ai package, and will see if switching to that makes a difference whenever that quota request goes through.

All in all, extremely frustrating though. No indication anywhere from Google that there’s an issue, and multiple different poorly documented client SDKs that seemingly have different behaviors.

A few other things I tried with no success:

switching where I was calling the SDK to a different region (this worked to fix a similar temporary issue that happened 6 months ago, no luck this time though)
switching from “gemini-1.5-pro-latest” to “gemini-1.5-pro-002”
switching the api version from the default “v1beta” to “v1” (this failed for me because JSON support doesn’t seem to exist in v1, and that’s pretty critical for using this programatically)

Switching down to gemini-1.5-pro-001 seems to be working for me for now, but given that 002 is in theory a stable model that’s a pretty poor outcome.

Topic		Replies	Views
Model is overloaded - Gemini API model	53	4089	June 3, 2025
503 UNAVAILABLE Gemini 2.0 Flash API Gemini API models , gemini-flash	11	794	May 8, 2025
[PARTIALLY SOLVED] Gemini models overloading with token windows of less than 20? Gemini API gemini-15 , api , models	14	1767	November 18, 2024
Getting a lot of "service unavailable" errors on gemini-2.0-flash Gemini API api , gemini-flash , gemini-20	17	954	July 8, 2025
Continuous Error: The model is overloaded Gemini API model	4	1635	November 20, 2024

Error: The model is overloaded

Related topics