Do API calls which result in 503 responses count towards the daily quota limit?

I was using the Gemini API and as usually during rush hours it returns a lot of 503 unavailable responses but suddenly it started giving me quota limit reached errors, even though I only had around seven successful responses

I’m using gemini-2.5-pro free tier

Hii @ronaldo_fenomeno
Welcome to the AI Forum!!

This could be a temporary issue occurring during peak usage periods, especially with long-context requests. We recommend temporarily directing traffic to an alternative model to see if the capacity limitation is specific to this model.
For more detailed information, please refer to this document.

Of the 50 requests I sent today, I was able to receive three responses. The remaining 47 were 503 errors. The model I’m using is gemini-2.5-pro.

I seem to be having the same issue with the free API.

Either this is temporary, or they’re no longer offering photo creation.

API Gemini (HTTP 429). Message: You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits. To monitor your current usage, head to: https://ai.dev/usage?tab=rate-limit.

Hii @Zafaraka_Man

Please follow the instructions below to troubleshoot the issue:

  1. Access the GCP Console and navigate to APIs & Services.
  2. Under Metrics, search for and select Generative Language API.
  3. Go to the Quotas & System Limits tab and review the Current Usage Percentage.

If the usage reaches 100%, it indicates that your quota limit has been reached, which is likely the cause of the 429 error.

If you have any questions regarding pricing, please refer to this document.

1 Like