[Urgent] Critical Discrepancy: High Volume of 429 Errors despite Rate Limit Dashboard showing usage far below quota (Gemini Paid Tier)

Dear Google Cloud Support Team,

I am experiencing a critical issue with the Gemini API within project adsbrain-dev. We are on Paid tier 1, but our application is suffering from a very high rate of 429 TooManyRequests errors, which contradicts the data shown in the Google AI Studio dashboards.

I am attaching two screenshots from the AI Studio dashboard demonstrating this discrepancy:

  1. Attachment 1 (Usage Dashboard): Shows a massive spike in “429 TooManyRequests” errors on Dec 11, reaching thousands of errors.

  2. Attachment 2 (Rate limits breakdown): For the exact same period, the “Peak requests per minute (RPM)” chart shows our usage peaked at only ~64 RPM, which is significantly below our assigned limit of 1,000 RPM for the gemini-2.5-flash model.

The Problem: There is a clear contradiction between your reported usage metrics and the actual API behavior. The dashboard indicates we are well within our safe quota limits (using less than 10% of capacity), yet the API is aggressively throttling our requests with 429 errors.

Request for Investigation: Please investigate the backend logs for project adsbrain-dev to determine the true cause of these 429 errors. Specifically:

  • Is there a hidden burst limit or regional quota that is not reflected in the main dashboard?

  • Is it possible that the project is incorrectly being enforced under “Free Tier” limits (15 RPM) despite being configured for Paid Tier?

This issue is severely impacting our production service. We need an explanation for this discrepancy and a resolution to stop the throttling when we are within our displayed limits.

Thank you.

2 Likes

Hello everyone,

we are experiencing the same issue in our Tier 1 plan for a couple of days now, receiving 429 errors after just a few hundrets API calls to gemini-2.5-pro or gemini-2.5-flash each day.

What we observed (additionally to TECH_MIC_ACE’s description):

  • When reached, the enforced rate limit persists for the rest of the day and seems to reset each night.
  • The displayed RPD for gemini-2.5-pro and for gemini-2.5-flash is at 10K, but we receive 429 before even reaching 1K for both models (see screenshot).

We’re an academic research lab using Gemini for scientific projects. Due to approaching deadlines, we are highly depending on a quick fix for that problem, so we’d appreciate any speedy help!

1 Like

Hey All,

Thank you for flagging this issue. We apologize for the inconvenience and have escalated it to our internal team for investigation. We will update you as soon as we have more information. Could you please provide the project number (not the project ID) via direct message if you have not yet done so?

1 Like

Hi @chunduriv - I am facing the same 429 TooManyRequests issue (way under my rate limits but getting 429s). I have a project number I can share with you, but I’m not seeing how to send you a direct message here. Please advise.

Hi @Nick_Harris,

Welcome to the Forum,

The issue should be fixed. Please check and let us know if the problem persists.

Thank you!

Hi @chunduriv ,

Thanks for looking into the issue. It seemed to be working better for about a day, but now I’m back to the same 429s issue with gemini-3-flash even though I’m well below my rate limit. This is with the same project number I shared with you previously.

Let me know,

Thanks,

Nick

Hi @Nick_Harris,

To help us better understand and resolve your issue, please provide a screenshot of your usage details from https://ai.dev/usage?tab=rate-limit.

Thank you!

Hi @chunduriv

I have hit the limit with gemini-3-pro-image, but not gemini-3-flash, yet I get 429s for both.

Here is my 7 day usage:

Thanks for taking a look,

Nick

Hi @Nick_Harris,

We appreciate you sharing the 7-day usage report. For more targeted troubleshooting, could you please provide the usage details for a 1 day, along with the complete 429 error response?

Thank you!

Gemini HTTP error 429 - You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits. To monitor your current usage, head to: https://ai.dev/rate-limit.

This is happening to me as well with gemini-3-flash even though my rate limits seem fine: