[Urgent] Critical Discrepancy: High Volume of 429 Errors despite Rate Limit Dashboard showing usage far below quota (Gemini Paid Tier)

TECH_MIC_ACE · December 15, 2025, 4:11am

Dear Google Cloud Support Team,

I am experiencing a critical issue with the Gemini API within project adsbrain-dev. We are on Paid tier 1, but our application is suffering from a very high rate of 429 TooManyRequests errors, which contradicts the data shown in the Google AI Studio dashboards.

I am attaching two screenshots from the AI Studio dashboard demonstrating this discrepancy:

Attachment 1 (Usage Dashboard): Shows a massive spike in “429 TooManyRequests” errors on Dec 11, reaching thousands of errors.

aistudio11313×934 63.8 KB
Attachment 2 (Rate limits breakdown): For the exact same period, the “Peak requests per minute (RPM)” chart shows our usage peaked at only ~64 RPM, which is significantly below our assigned limit of 1,000 RPM for the gemini-2.5-flash model.

aistudio21313×812 52.7 KB

The Problem: There is a clear contradiction between your reported usage metrics and the actual API behavior. The dashboard indicates we are well within our safe quota limits (using less than 10% of capacity), yet the API is aggressively throttling our requests with 429 errors.

Request for Investigation: Please investigate the backend logs for project adsbrain-dev to determine the true cause of these 429 errors. Specifically:

Is there a hidden burst limit or regional quota that is not reflected in the main dashboard?
Is it possible that the project is incorrectly being enforced under “Free Tier” limits (15 RPM) despite being configured for Paid Tier?

This issue is severely impacting our production service. We need an explanation for this discrepancy and a resolution to stop the throttling when we are within our displayed limits.

Thank you.

MaggiR · December 15, 2025, 10:18pm

Hello everyone,

we are experiencing the same issue in our Tier 1 plan for a couple of days now, receiving 429 errors after just a few hundrets API calls to gemini-2.5-pro or gemini-2.5-flash each day.

What we observed (additionally to TECH_MIC_ACE’s description):

When reached, the enforced rate limit persists for the rest of the day and seems to reset each night.
The displayed RPD for gemini-2.5-pro and for gemini-2.5-flash is at 10K, but we receive 429 before even reaching 1K for both models (see screenshot).

We’re an academic research lab using Gemini for scientific projects. Due to approaching deadlines, we are highly depending on a quick fix for that problem, so we’d appreciate any speedy help!

chunduriv · December 16, 2025, 11:58pm

Hey All,

Thank you for flagging this issue. We apologize for the inconvenience and have escalated it to our internal team for investigation. We will update you as soon as we have more information. Could you please provide the project number (not the project ID) via direct message if you have not yet done so?

Nick_Harris · January 8, 2026, 5:19am

Hi @chunduriv - I am facing the same 429 TooManyRequests issue (way under my rate limits but getting 429s). I have a project number I can share with you, but I’m not seeing how to send you a direct message here. Please advise.

chunduriv · January 9, 2026, 12:02am

Hi @Nick_Harris,

Welcome to the Forum,

The issue should be fixed. Please check and let us know if the problem persists.

Thank you!

Nick_Harris · January 12, 2026, 6:00am

Hi @chunduriv ,

Thanks for looking into the issue. It seemed to be working better for about a day, but now I’m back to the same 429s issue with gemini-3-flash even though I’m well below my rate limit. This is with the same project number I shared with you previously.

Let me know,

Thanks,

Nick

chunduriv · January 12, 2026, 7:37pm

Hi @Nick_Harris,

To help us better understand and resolve your issue, please provide a screenshot of your usage details from https://ai.dev/usage?tab=rate-limit.

Thank you!

Nick_Harris · January 12, 2026, 8:28pm

Hi @chunduriv

I have hit the limit with gemini-3-pro-image, but not gemini-3-flash, yet I get 429s for both.

Here is my 7 day usage:

Thanks for taking a look,

Nick

chunduriv · January 12, 2026, 8:43pm

Hi @Nick_Harris,

We appreciate you sharing the 7-day usage report. For more targeted troubleshooting, could you please provide the usage details for a 1 day, along with the complete 429 error response?

Thank you!

Nick_Harris · January 12, 2026, 9:04pm

Gemini HTTP error 429 - You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits. To monitor your current usage, head to: https://ai.dev/rate-limit.

Ben_Ciccarelli · January 13, 2026, 9:12pm

This is happening to me as well with gemini-3-flash even though my rate limits seem fine:

Simon_O_Magus · February 13, 2026, 5:39pm

my TPM limit is stuck. I know I didn’t exceed it this time. I keep getting rate limit blocked for days. I was told the google employees are manually fixing this problem for people on here. Please fix my rate limit issue.

I am not hitting even hit the TPM like it shows. but, its completely locked for days, nothing but these 429 errors. This has happened 3 or 4 times already and now I know it’s not my fault so please fix it.

Bill_Gustin · February 26, 2026, 4:41pm

I am hitting 429 errors even though I am on a paid tier and way below the limit. Please let me know what info you need from me to resolve.

ilatypov · April 21, 2026, 7:51pm

I’m on Tier 2, below limits, but getting 429 errors. When will this be finally fixed?

Jeet_Shah · April 22, 2026, 12:34pm

Fix is to upgrade your models. from 2.0 to 2.5

@ilatypov

emaktech · April 23, 2026, 2:29am

I started getting the same in the last week or so on Gemini 2.0 Flash.

Same thing - 429s at less than 5% of usage limit.

Honestly no idea what Google is doing because they guarantee it to be a stable endpoint until June 1st 2026. Not a good look.

Then 2.5 Flash is the only other comparable model marked as stable right now but it’s deprecating in July. Horrible communication and dismal support for products that, according to roadmaps should be supported as production stable.

And then what will be left to move to? 3.1 Flash Lite? Nothing meets intelligence and latency of 2.0 Flash. If we lose this and 2.5 and are forced into 3.x series models as they exist today I don’t think Google will have a single actually low latency model. Obviously will have to look elsewhere because they are abandoning the low latency intelligence space. Pretty disappointing.

Mahesh_Sutar · April 23, 2026, 6:05am

Hi @ilatypov ,
Can you DM your project numbers (not the Project ID) and models you are using . To send a direct message, click on my profile picture or name, and select the Message button.

ilatypov · April 23, 2026, 9:45am

I changed model to 2.5 and it helped. I do not get 429 errors anymore. Thank you!

Jeet_Shah · April 23, 2026, 9:58am

If you are using this API key for production, i would suggest you to migrate to Vertex AI , same costing and more uptime guaranteed.

Jon_Matthews · May 8, 2026, 2:57pm

If you’re receiving 429’s and have checked no rate limit was breached for the period, please add your details to this form & we’ll check it out.

Topic		Replies	Views
Gemini API 429 Error Despite Low Quota Usage on Paid Tier (gemini-2.5-flash) Gemini API bug , rate-limits	40	2651	May 3, 2026
429 error with quota with tier Gemini API ai-studio , api , gemini	45	2195	March 12, 2026
Issue with 429 Error on Gemini API Despite Staying Within Rate Limits Gemini API gemini-api	13	1899	March 10, 2026
Critical Discrepancy: High 429 Errors despite being <10% of Paid Tier Quota (Project: aigrademypaper)) Google AI Studio ai-studio , api , gemini , rate-limits	0	32	May 11, 2026
Critical Issue 429 Rate Limit Errors on Tier 3 Account with low traffic Gemini API ai-studio , gemini , gemini-flash-2-5	0	103	February 4, 2026

[Urgent] Critical Discrepancy: High Volume of 429 Errors despite Rate Limit Dashboard showing usage far below quota (Gemini Paid Tier)

Related topics