Implicit Caching: Gemini 2.5 Pro Preview 05-06

roteck · May 12, 2025, 4:14am

Anyone get implicit caching results with their prompts yet? I still haven’t seen my cached tokens in my usage metadata change from None. I had a 54,000 token static prompt, sent the same message several times, not a single token cached.

I dropped the prompt down to 11,000 tokens, continued sending the same message over and over, still None.

cache_tokens_details=None cached_content_token_count=None candidates_token_count=74 candidates_tokens_details=None prompt_token_count=11629 prompt_tokens_details=[ModalityTokenCount(modality=<MediaModality.TEXT: ‘TEXT’>, token_count=11629)] thoughts_token_count=260 tool_use_prompt_token_count=None tool_use_prompt_tokens_details=None total_token_count=11963 traffic_type=None

Kiran_Sai_Ramineni · May 12, 2025, 11:42am

Hi @roteck, Thanks for reporting this issue. while reproducing the issue we have observed the same. Will bring this to the engineering teams attention. Thank You.

roteck · May 12, 2025, 12:23pm

You’re welcome, I was very excited to learn that implicit caching came to Gemini.

If it helps I’ve managed to successfully get implicit token caching to mostly reliably happen. The only way I’ve had success is to rapidly send message after message, then it will cache almost always with the 2nd message and on. But it seems that if I hesitate for at least 10 seconds, it doesn’t cache. Very short TTL? This makes implicit caching for my applications ineffective.

Since then I worked on learning about explicit caching and have begun using that for now until the new implicit system is working reliably.

If you need any info from me or want me to try any techniques, let me know.

Pannaga_J · June 25, 2025, 9:47am

Hi @roteck This issue should be resolved by now. Could you please try to use latest 2.5 Pro model and let us know if the issue still persist? Thank you!

Topic		Replies	Views
Implicit Caching not Working on Gemini 2.5 Pro Gemini API gemini-2-5 , context_caching	3	239	June 16, 2025
Implicit Caching Not Working for Gemini-2.5-Pro with 30k+ Tokens Despite Documentation Requirements Gemini API api , prompt	0	50	August 18, 2025
Gemini 2.5 Flash implicit caching problem Gemini API api , context_caching	4	275	July 13, 2025
Have anyone checked out the implicit caching for gemini api, caches hits are inconsistent for me Gemini API gemini-api , gemini-2-5	7	309	June 13, 2025
Flash implicit caching only works after 6k tokens vs the advertised 1k tokens Gemini API api , gemini-flash	1	59	July 2, 2025

Implicit Caching: Gemini 2.5 Pro Preview 05-06

Related topics