Gemini Explicit Context Caching (cached_content) drops system_instruction in livekit.plugins.google — How to inject dynamic session variables?

Harshita_Sukumar_Pat · June 9, 2026, 7:05am

We are working on optimizing costs for our voice agent utilizing gemini-2.5-flash via the livekit.plugins.google plugin. Our system prompts are large (5,000+ tokens), so we are leveraging Gemini’s Explicit Context Caching by passing a pre-warmed cached_content ID.

However, because this is an inbound voice bot, every single phone call contains dynamic runtime variables unique to that user session (e.g., customer_name, account_balance, loan_eligibility).

If we bake these variables into the static Cache ID, we cause massive cache-miss overhead and risk variable hallucination across callers. To bypass this, we tried passing the dynamic variables inside the agent initialization’s system_instruction field alongside the cached_content ID, expecting them to blend.

Instead, the plugin completely drops the system_instruction parameter, throwing this warning:

{
  "message": "dropping ['system_instruction'] from Gemini request because cached_content='projects/225719900046/locations/asia-south1/cachedContents/119712627008995328' is set; these fields must be baked into the CachedContent resource", 
  "level": "WARNING", 
  "name": "livekit.plugins.google"
}

Questions:

Is it a strict limitation of the Gemini API or the LiveKit integration that prevents passing runtime-appended system_instruction rules on top of an explicit cached_content resource?
What is the recommended LiveKit pattern to utilize explicit context caching for the static instruction layout while still declaring dynamic session metadata safely on a per-job basis?

Topic		Replies	Views
System instruction and implicit caching question Gemini API api , context_caching	4	275	March 5, 2026
Best practice for injecting dynamic, non-conversational context in Gemini prompts? Gemini API gemini-api , prompt	2	170	December 24, 2025
Gemini 2.5 Flash Live Implicit Context Caching Not Working / Feedback Gemini API models , gemini	4	302	December 22, 2025
Using cached contents with Agent Gemini API gemini-flash , context_caching	1	287	June 26, 2025
Live API with ephemeral token ignores the system_instruction Gemini API api , gemini	4	202	January 16, 2026

Gemini Explicit Context Caching (cached_content) drops system_instruction in livekit.plugins.google — How to inject dynamic session variables?

Questions:

Related topics