cachedContentTokenCount only showing explicit cached tokens

Eric_Tom_Mathews · August 23, 2025, 5:23pm

Hey,

I am seeking clarity on the interaction between explicit and implicit caching in the Gemini API, specifically regarding the cachedContentTokenCount field in usage metadata.

In my tests:

When explicit coaching (context cache) is enabled, the cachedContentTokenCount reports only the explicitly cached tokens.

When explicit coaching is disabled, the field reflects tokens hit via implicit caching, as expected.

However, this means I can’t see implicit cache benefits when explicit coaching is turned on, even for repeated prompt prefixes. Is this the intended behavior, or is it a bug or limitation of the API? Should implicit caching be combined with explicit caching (e.g., on requests mixing prefix cache and user content)?

I’d appreciate clarification or a pointer to relevant documentation. Has anyone else seen this or have official guidance for the expected API behavior in this scenario?

Topic		Replies	Views
Implicit Caching: Gemini 2.5 Pro Preview 05-06 Gemini API context_caching , gemini_25_pro	3	445	June 25, 2025
Implicit Caching Not Working for Gemini-2.5-Pro with 30k+ Tokens Despite Documentation Requirements Gemini API api , prompt	2	236	September 3, 2025
Gemini 2.5 Flash Lite: Implicit Caching Not Working Despite Meeting Documented Requirements Gemini API bug , gemini	1	334	March 4, 2026
Has anyone gotten implicit caching to work? Gemini API gemini-3	2	81	May 5, 2026
Implicit Context Caching stops working when a thinking budget is set – metadata.cached_token becomes None Gemini API model , context_caching	1	142	July 22, 2025

cachedContentTokenCount only showing explicit cached tokens

Related topics