2.0-flash request considered as 2.5-flash in quotas

Guillaume_Volumiq · December 9, 2025, 2:52pm

Since this morning we noticed our gemini-2.0-flash ended with RESOURCE_EXHAUSTED response and appeared to trigger a PerDayPerProjectPerModel quota limit violation.

After reviewing the “Usage and Billing” tab it appears all request were logged as gemini-2.5-flash ones, and the dashboard says we reached it’s limit, even though the 429 response clearly stated we were using 2.0.

The weird part is that even with the model supposedly show as capped for today, we’re still able to get successful responses when testing gemini-2.5-flash, with the RPD still incrementing past limits.

Response :

{
       "quotaMetric": "generativelanguage.googleapis.com/generate_content_free_tier_requests",
       "quotaId": "GenerateRequestsPerDayPerProjectPerModel-FreeTier",
       "quotaDimensions": {
         "location": "global",
         "model": "gemini-2.0-flash"
       }
     }

Shivam_Singh2 · December 23, 2025, 5:02am

Hii @Guillaume_Volumiq
Welcome to the Google AI Forum!!!

Thank you for bringing this to our attention.
Apologies for the delayed response. Could you please confirm if you are still facing the same problem?

Topic		Replies	Views
You exceeded your current quota. Please migrate to Gemini 2.0 Flash Preview (Image Generation) Gemini API api , gemini	2	153	December 8, 2025
429 error with quota with tier Gemini API ai-studio , api , gemini	38	1110	January 3, 2026
HTTP 429 Quota Exceeded Gemini API api , gemini-flash-2-5	1	98	September 29, 2025
I'm facing with the Gemini API 429 RESOURCE_EXHAUSTED Gemini API gemini-flash , api-key	4	486	January 9, 2026
Facing Gemini Rate limit issue from 1 day Gemini API ai-studio , billing	2	348	January 5, 2026

2.0-flash request considered as 2.5-flash in quotas

Related topics