Context Cache Creation with Pro Model Variants

adish_jain · November 25, 2024, 12:21am

Hey! I have the following function which I’m using to create a context cache:

def create_context_cache(self, video_file: Any) -> Any:
        """Creates a context cache from the video file."""
        try:
            cache = genai.caching.CachedContent.create(
                model=self.model_name_pro,
                system_instruction="You are an AI video editor assistant.",
                contents=[video_file],
                ttl=timedelta(minutes=30),
            )
            print("created context cache")
            return cache
        except Exception as e:
            raise Exception(f"Failed to create context cache: {str(e)}")

When I use self.model_name_pro = "models/gemini-1.5-pro-001", it seems the cache doesn’t get created for inputs (~3500 tokens) that don’t meet the minimum token requirement of ~32k tokens. However, if I use self.model_name_flash = "models/gemini-1.5-pro-002", the cache seems to get created. I thought the minimum token requirement for context caching is the same regardless of model:

CachedContent(
    name='cachedContents/v4vmw2zafkxl',
    model='models/gemini-1.5-pro-002',
    display_name='',
    usage_metadata={
        'total_token_count': 3549,
    },
    create_time=2024-11-25 00:19:28.067023+00:00,
    update_time=2024-11-25 00:19:28.067023+00:00,
    expire_time=2024-11-25 00:49:27.064035+00:00
)

adish_jain · November 27, 2024, 6:22pm

Any updates here? @Logan_Kilpatrick @Manorama_Namboori

Topic		Replies	Views
Context cache not available for Gemini 2.0 Flash free tier? Gemini API api , gemini-flash	3	81	June 22, 2025
Cached content is too small. total_token_count=1 Google AI Studio api , gemini-flash	1	72	June 23, 2025
Gemini 2.0 Cached content minimum size is too large Gemini API gemini-flash , context_caching	3	74	May 30, 2025
Has anyone here managed to get context caching working for gemini-2.0-flash-001 on a free tier account? Gemini API gemini-flash , billing	18	411	June 19, 2025
Any attempt to use cached context results in 500 Internal Server error Gemini API api , model	1	192	December 25, 2024

Context Cache Creation with Pro Model Variants

Related topics