Did My Vertex AI Input Caching Fail?

Long_Peng · April 30, 2025, 3:31am

I ask this because it seems that it didn’t work, and it may cost me a lot of money.

Caching disambiguation says input caching is enable by default.

I am sure that I have enable default input caching, but when I check response.usage_metadata, it seems that it didn’t work

Here is the code:

from google import genai
from google.genai import types


gemini_client = genai.Client(
    vertexai=True,
    project='gen-lang-client-0021737048',
    location='us-central1',
    http_options=types.HttpOptions(api_version='v1')
)

chat_session=gemini_client.chats.create(model="gemini-2.0-flash-001", history=[])

resp1=chat_session.send_message(message="hi, I am john")
print(resp1.usage_metadata.total_token_count) # 21

resp2= chat_session.send_message(message="Do u know my name?")
print(resp2.usage_metadata.cached_content_token_count) # it should be 21, but it is None

Thank you!

Akhilesh_Kambhampati · May 1, 2025, 5:52pm

@Long_Peng, minimum length to enable Context caching is 4096 tokens . please refer to documentation for any further details

Long_Peng · May 2, 2025, 12:02am

Thank you. Can default caching be used for caching conversation like OpenAI/Claude/DeepSeek? How to cache conversation like OpenAI/Claude/DeepSeek?

Topic		Replies	Views
Implicit Caching: Gemini 2.5 Pro Preview 05-06 Gemini API context_caching , gemini_25_pro	2	156	May 12, 2025
How to cache conversation like OpenAI/Claude/DeepSeek? Gemini API help-request	2	136	May 14, 2025
Have anyone checked out the implicit caching for gemini api, caches hits are inconsistent for me Gemini API gemini-api , gemini-2-5	6	144	May 20, 2025
Cannot create cache with client.caches.create() – RESOURCE_CATEGORY_GENAI_CACHE not found Gemini API python , generative-ai	1	23	May 23, 2025
Status 500 when attempting vertex context caching Gemini API api	1	67	September 5, 2024

Did My Vertex AI Input Caching Fail?

Related topics