503 "The model is overloaded" is being counted against the rate limits

Flipp_Fuzz · November 16, 2025, 1:59pm

Background

I was fiddling around with an application that auto-retries upon failure.

Error received - Model is Overloaded

I’m OK with this and will simply retry.

google.genai.errors.ServerError: 503 UNAVAILABLE. {'error': {'code': 503, 'message': 'The model is overloaded. Please try again later.', 'status': 'UNAVAILABLE'}}

After a bunch of iterations, I unexpectedly got quota exceeded

google.genai.errors.ClientError: 429 RESOURCE_EXHAUSTED. {'error': {'code': 429, 'message': 'You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits. To monitor your current usage, head to: https://ai.dev/usage?tab=rate-limit. \n* Quota exceeded for metric: generativelanguage.googleapis.com/generate_content_free_tier_requests, limit: 50, model: gemini-2.5-pro\nPlease retry in 48.805646254s.', 'status': 'RESOURCE_EXHAUSTED', 'details': [{'@type': 'type.googleapis.com/google.rpc.Help', 'links': [{'description': 'Learn more about Gemini API quotas', 'url': 'https://ai.google.dev/gemini-api/docs/rate-limits'}]}, {'@type': 'type.googleapis.com/google.rpc.QuotaFailure', 'violations': [{'quotaMetric': 'generativelanguage.googleapis.com/generate_content_free_tier_requests', 'quotaId': 'GenerateRequestsPerDayPerProjectPerModel-FreeTier', 'quotaDimensions': {'model': 'gemini-2.5-pro', 'location': 'global'}, 'quotaValue': '50'}]}, {'@type': 'type.googleapis.com/google.rpc.RetryInfo', 'retryDelay': '48s'}]}}

But, if you look at the usage dashboard on AI studio, 31/50 of the requests were due to “The model is overloaded”

Questions

Shouldn’t “The model is overloaded“ not be counted against the limits?
If I transition to a paid tier, would I be billed for 503?

These are errors on Google’s end and not the user.

Setapca · November 16, 2025, 2:54pm

Every time you send a request and receive a 503 response, Google records it as successful and counts your quota.

Shivam_Singh2 · December 3, 2025, 5:58am

Hii @Flipp_Fuzz

Thank you for reaching out to us.

Could you please let us know which model you are using?? This will help us better understand and analyze the problem so we can provide you with a more accurate and helpful response.

Flipp_Fuzz · December 3, 2025, 6:26am

It is in the image. gemini-2.5-pro

Pooja_Kapse · January 9, 2026, 2:35pm

Hi @Flipp_Fuzz and @Setapca,
Thanks for reporting this issue.
Could you please confirm, if you are still facing this issue?

Topic		Replies	Views
Gemini 503 APIError: 503 UNAVAILABLE and Gemini quota exceeded: 429 RESOURCE_EXHAUSTED Gemini API ai-studio , api , models , gemini	1	256	November 26, 2025
Do API calls which result in 503 responses count towards the daily quota limit? Gemini API gemini-2-5	4	175	November 24, 2025
Every day 503 errors with msg model is overloaded Gemini API api , model	5	457	August 23, 2025
You exceeded your current quota, please check your plan and billing details Gemini API api , billing	4	2277	January 22, 2026
Model is overloaded? gemini-2.5-pro Gemini API gemini_25_pro , gemini-flash-2-5	1	174	October 28, 2025