Issue with 429 Error on Gemini API Despite Staying Within Rate Limits

I am encountering an issue with the Gemini Flash 1.5 API, where I receive a 429 (Too Many Requests) error after making only 2-4 requests, despite the documented rate limit being 2000 requests per minute (RPM) for my paid plan.

I have already attempted adding delays between requests, yet the issue persists. Based on the API documentation, I should be well within the allowed rate limits, but the system is still throttling my requests unexpectedly.

Could you please help me understand:

  1. Whether there are any undocumented limitations or restrictions that might be causing this?
  2. If my account or API key is being incorrectly rate-limited?
  3. Any recommended troubleshooting steps to resolve this issue?
3 Likes

i’m having the same issue. can’t find any solution. on paid tier using gemini-2.0-flash

Yup, same issue on my side. Trying to process documents, at a rate of 1 small 1-page PDF document every 5 minutes, and after 3 documents I get 429 error. Also trying to use gemini-2.0-flash

I’m on a paid GCP account, with a billing profile that has many other GCP projects with paid-for resources that all work perfectly fine.