Gemini-2.0-Flash Response is returning "429" even with peak usage to be less than 1%

Xiaoyu_Shawn_Que · May 21, 2025, 7:31am

Hi team,
In recent one week, we’ve been seeing this issue happening multiple times when calling methond google.ai.generativelanguage.v1beta.GenerativeService.StreamGenerateContent. Can you pls check why we’re getting this 429 issue related to model Gemini-2.0-flash-lite?

We’re seeing Error with message in prod 429 RESOURCE_EXHAUSTED. {'error': ('code': 429, 'message': 'Resource has been exhausted (e.g. check quota)', 'status': 'RESOURCE_EXHAUSTED'
When running code:

 response = genai_client.models.generate_content_stream(
                model=model,
                contents=history,
                config=config,
            )
            return 
  self._handle_generate_stream_response(model, credentials, response, prompt_messages)

It’s affecting our prod env but I checked from “API/Service Details” the usage at peak is < 1%. It makes no sense.

Xiaoyu_Shawn_Que · May 21, 2025, 7:33am

chunduriv · June 12, 2025, 12:11am

Hi @Xiaoyu_Shawn_Que,

We have recently updated error message for 429 error that specifies which rate limits are being exceeded. Could you please retry and verify on your end?

If you still face any issues, please feel free to post entire error message, so we can help you better.

Thank you!

Topic		Replies	Views
Getting 429 Errors - But Usage Charts Show no Traffic Gemini API api	53	2349	June 23, 2025
Issue with 429 Error on Gemini API Despite Staying Within Rate Limits Gemini API gemini-api	7	477	June 23, 2025
[FREE tier] Noticeable drop in gemini-2.0-flash throughput (429 errors) Gemini API gemini-api , gemini-20 , rate-limits	1	56	June 17, 2025
Gemini API Errors Gemini API api	9	449	June 24, 2025
Gemini API returns 429 issue when using OpenAI compatible API Gemini API api , generative-ai	7	529	January 26, 2025

Gemini-2.0-Flash Response is returning "429" even with peak usage to be less than 1%

Related topics