Request: Exclude failed high-traffic requests from Gemini Pro quota usage

I am currently using the Gemini Pro plan and have encountered a recurring issue.

When sending requests in Google Antigravity IDE, I frequently receive the following error:

“Our servers are experiencing high traffic right now, please try again in a minute.”

However, even though the request fails due to server-side high traffic and no response is generated, the request is still deducted from my allocated Gemini Pro quota.

This effectively reduces my available usage without receiving any result, which significantly impacts my ability to work productively within the assigned limits.

Since these failures are clearly caused by server-side load and not by successful model execution, I kindly request:

That requests resulting in high-traffic server errors are not counted against the Gemini Pro quota.

At the moment, with the current limits and server instability, the Gemini Pro plan becomes difficult to use effectively, even with moderate usage.

Thank you in advance.
Best regards,

Hi @Yes,

Thank you for your valuable feedback. We sincerely appreciate you taking the time to share your thoughts with the community. Your input is crucial for our continuous improvement and we have shared it directly with the relevant internal team.

During the extended outage yesterday, when Antigravity tried automatically over and over to submit the request, only to fail each time, I noticed that it wasted over 40% of my request quota (that is already extremely low since the length of time that they refresh seems to be randomly reduced these days). The quota that was wasted did not seem to be automatically returned today and these kinds of nuances are what make people not want to subscribe to higher tiers, use AI Credits for work, and make other offerings from competitors more attractive.