Standard retry logic for Gemini Python SDK?

hie · August 10, 2024, 2:17pm

Does anyone know of a repeatable, official, retry logic for the Gemini API? I see 100s of code “cookbook” examples, scant return codes listed in the official API docs, but no robust examples of how to handle the various codes that are returned (429,503, etc.).

This seems to work for 429 responses, but not for 503 responses:

from google.generativeai.types import RequestOptions
from google.api_core import retry

...

response = model.generate_content(user_message,
                                      request_options=RequestOptions(
                                        retry=retry.Retry(
                                            initial=10, 
                                            multiplier=2, 
                                            maximum=60, 
                                            timeout=300
                                        )
                                       )
                                    )

Govind_Keshari · February 21, 2025, 1:43pm

Hey @hie,

Use below retry code :

import google.generativeai as genai
from google.api_core import retry

model = genai.GenerativeModel('gemini-2.0-flash')

# For convenience, a simple wrapper to let the SDK handle error retries
def generate_with_retry(model, prompt):
  return model.generate_content(prompt, request_options={'retry':retry.Retry()})

I think the default limit is 5min, but it’s configurable.

Topic		Replies	Views
How to Implement Retry Logic in the New Python SDK? Gemini API api , python	1	80	May 13, 2025
❌ ERROR Resource has been exhausted (e.g. check quota) Gemini1.5-pro Gemini API gemini-15 , models	2	549	August 11, 2024
Why always getting Status 429? Very frustrating Gemini API	18	2914	August 10, 2024
Gemini 2.0 timing out Gemini API gemini-api , model	2	873	May 21, 2025
Wierd Error when calling api Google AI Studio api , python-library	3	139	September 21, 2024

Standard retry logic for Gemini Python SDK?

Related topics