Penalty for reaching quota in pay-as-you-go with a fine-tuned model?

wije · October 26, 2024, 5:26am

I have this fined tuned model based off of gemini-1.5-flash that I want hit using the API. Thinking that the RPM was 2000, I instructed my code to make about 1500 request per minute. Long story short I experienced a lot of RESOURCE_EXHAUSTED errors and soon only RESOURCE_EXHAUSTED errors - even with a much lower rpm and very patient try and retry logic. I then gave it a 12 hour rest and made a single request, RESOURCE_EXHAUSTED.

I have since learned that the RPM is very likely 360 for this model. Fine, I can work with that. But when can I resume using the model at the lower rate? AM I being penalised for X amount of time for reaching the quota or there is some other limitation in place? Like, maybe I have also reached some unknown daily max request?

Can anyone shed some light on this?

afirstenberg · October 26, 2024, 12:54pm

There shouldn’t be a request-per-day limit on the paid tier.
But if there is, it resets at midnight US West Coast time (ie - the time in Mountain View, CA).

wije · October 27, 2024, 7:47pm

Update: Gave this another go 12h later again and stayed well within the quota, about 300 RPM. This worked for bit but eventually all I got was RESOURCE_EXHAUSTED. The only conclusion is that there is some other limitation than RPM. I just want to know what it is so I can make plans!

Topic		Replies	Views
RESOURCE_EXHAUSTED when try to use gemini-1.5-pro-002 via API Gemini API gemini-15 , ai-studio , api	3	516	October 5, 2024
Gemini-1.5-pro-002 quotas lower than 001 Gemini API gemini-15 , vertexai	7	1305	November 19, 2024
Gemini API Free Tier Daily Quota (25 RPD) Blocking Paid Usage (Tier 1 - 1000 RPD) Gemini API api , gemini	1	149	April 17, 2025
10 RPM quota being applied to Paid tier (gemini-2.0-flash-lite-preview-02-05) Google AI Studio api , models	6	412	February 18, 2025
429 Resource has been exhausted even enrolled in paid and within quota Gemini API	7	488	October 5, 2024

Penalty for reaching quota in pay-as-you-go with a fine-tuned model?

Related topics