Is there any way to contact Google and request higher rate limits for Gemini? Out application is bursty. We might send 20m tokens as fast as we can and then nothing for minutes. The 4m tokens per min is killing us.
Hi,
Welcome to the forum.
In case you’re using Vertex AI you can request a quota increase in the Google Cloud Platform.
Cheers
Is vertix different from AI Studio?
Hi,
Yes, it’s more enterprise-focused. I see AI Studio more as a prototyping playground.
Cheers
Hi @Scott_Swigart,
Sorry for the late response. Each model variation has an associated rate limit (requests per minute, RPM). For details on those rate limits, see Gemini models.
Please fill out Request paid tier rate limit increase form if you’d like to request a rate limit increase for your Gemini API project,
We offer no guarantees about increasing your rate limit, but we’ll do our best to review your request and reach out to you if we’re able to accommodate your capacity needs.
Thank you!