Google AI Developers Forum

BUG: 429 RESOURCE_EXHAUSTED on Paid Tier 1 (Gemini 2.5 Flash) despite <1% Quota Usage

jimmy_smith March 9, 2026, 5:21am 1

To the Google AI Team,

I am experiencing a severe backend quota synchronization bug on my Paid Tier 1 account that is completely halting my production environment.

My script is making calls to the gemini-2.5-flash model. My project is linked to an active, paid billing account. According to my Google Cloud API Quota Dashboard, I am well under all limits:

Peak RPM: 4 / 1,000
Peak TPM: 389.74K / 1,000,000

Despite being at less than 1% of my RPM limit and 39% of my TPM limit, the API is aggressively throwing 429 RESOURCE_EXHAUSTED errors. The server is completely rejecting burst traffic (even as low as 2-3 rapid requests) and forcing a 60 to 120-second timeout before accepting a single new request.

This is clearly the known “Ghost 429” dynamic quota bug affecting Tier 1 accounts, where the edge servers are enforcing an invisible, hyper-restrictive rate limit that ignores the 1,000 RPM / 1M TPM quota displayed on my dashboard.

My Details:

Model: gemini-2.5-flash
Error: 429 RESOURCE_EXHAUSTED

Please manually investigate my Project ID, lift the probationary/burst throttles on the backend, and sync my actual server allocation to match my Tier 1 dashboard limits so I can resume processing.

Thank you.

NATTAPON_THONGPON March 9, 2026, 5:32am 2

To the Google AI Team,

I am experiencing a severe backend quota synchronization bug on my Paid Tier 1 account that is completely halting my production environment.

My script is making calls to the gemini-2.5-flash model. My project is linked to an active, paid billing account. According to my Google Cloud API Quota Dashboard, I am well under all limits:

Peak RPM: 4 / 1,000
Peak TPM: 389.74K / 1,000,000

Despite being at less than 1% of my RPM limit and 39% of my TPM limit, the API is aggressively throwing 429 RESOURCE_EXHAUSTED errors. The server is completely rejecting burst traffic (even as low as 2-3 rapid requests) and forcing a 60 to 120-second timeout before accepting a single new request.

This is clearly the known “Ghost 429” dynamic quota bug affecting Tier 1 accounts, where the edge servers are enforcing an invisible, hyper-restrictive rate limit that ignores the 1,000 RPM / 1M TPM quota displayed on my dashboard.

My Details:

Model: gemini-2.5-flash
Error: 429 RESOURCE_EXHAUSTED

Please manually investigate my Project ID, lift the probationary/burst throttles on the backend, and sync my actual server allocation to match my Tier 1 dashboard limits so I can resume processing.

Thank you.

Topic		Replies	Views	Activity
[BUG] Persistent 429 Errors on Paid Tier 1 Despite <10% Quota Usage (gemini-2.5-flash) Gemini API bug , api , gemini-flash-2-5	0	81	March 29, 2026
Tier 1 Postpay account getting 100% 429 RESOURCE_EXHAUSTED with no error.details[] for 24h+ Gemini API billing , rate-limits	3	202	April 22, 2026
Gemini API 429 Error Despite Low Quota Usage on Paid Tier (gemini-2.5-flash) Gemini API bug , rate-limits	40	2855	May 3, 2026
Persistent 429 RESOURCE_EXHAUSTED error on Tier 1, well below quota limits (gemini-2.5-flash) Gemini API gemini-flash	2	201	December 18, 2025
Issue: Persistent 429 RESOURCE_EXHAUSTED on Paid Tier 1 Gemini API rate-limits	1	197	March 13, 2026