[FREE tier] Noticeable drop in gemini-2.0-flash throughput (429 errors)

podgancar · June 15, 2025, 9:54am

In the past week, we’ve started getting a lot of 429 errors, which we never saw before. We could easily process 1,000 items before, but now the API is struggling to handle even 40 at a time. I’ve attached graphs that show our usage over time. We also throttle the requests according to the rate limits listed here: Rate limits | Gemini API | Google AI for Developers. Is there anything we can do? We’re seeing the same issue with other models as well. We’ve also noticed that paid users experience the same problems, so we don’t see a point in upgrading. Moreover, our usage is still within the limits of the free tier.

@Krish_Varnakavi1 let me know if you guys need any data or help in debugging this issue.

Krish_Varnakavi1 · June 17, 2025, 11:55pm

Hi @podgancar,

Welcome to the Google AI Forum!

To troubleshoot this issue, please follow the below instructions:

Go to GCP console and click “APIs & Services”. Under Metric, search and select “Generative Language API”.. Under “Quotas & System Limits” tab, check for “Current Usage percentage”..

If it reaches 100%, then you have reached your quota limits.

If you think that there is any discrepancy, please DM me with a clear error message and Project ID to help us investigate further.

Topic		Replies	Views
Issue with 429 Error on Gemini API Despite Staying Within Rate Limits Gemini API gemini-api	7	530	June 23, 2025
Getting 429 Errors - But Usage Charts Show no Traffic Gemini API api	54	2519	July 3, 2025
Persistent 429 Errors (Quota Exceeded) for all Gemini Models except 2.5 Flash on Free Tier Gemini API billing , gemini-flash-2-5	3	299	June 10, 2025
Gemini API Errors Gemini API api	10	487	June 30, 2025
429 Quota Exceeded with Gemini Pro API Gemini API gemini-api	21	994	June 11, 2025

[FREE tier] Noticeable drop in gemini-2.0-flash throughput (429 errors)

Related topics