Context caching blocked on Paid Tier 1 — max_total_token_count=0

Tommy_Roldan · April 13, 2026, 3:18am

Hi Gemini API Team,

I’m hitting a provisioning issue where context caching is completely disabled for my project despite being on an active Paid Tier 1 account with billing confirmed.

The error:

400 INVALID_ARGUMENT: Cached content is too large. total_token_count=1328389, max_total_token_count=0

Key details:

Account is Paid Tier 1 with active billing
max_total_token_count reports as 0 for all models
Caching works for smaller content (1-2 chunks) but fails at 3+ chunks (~1.3M tokens)
Model: gemini-2.5-pro

This appears to be the same provisioning sync issue reported in other threads where the cache quota is hardcoded to 0 despite an active paid account.

Can someone check why the cache quota is set to 0 for this project and force a resync?

Thank you

PLAYi · April 15, 2026, 4:56pm

Same. What gives? No response?

Topic		Replies	Views
Context caching blocked on Paid Tier 1 max_total_token_count=0 Gemini API billing	0	19	April 14, 2026
Gemini Context Caching Broken on Paid Tier 1 Account — max_total_token_count=0 Gemini API gemini	0	22	April 14, 2026
BUG: Context Caching blocked (max_total_token_count=0) on Paid Tier 1 project Gemini API bug , gemini-api	0	53	March 4, 2026
Has anyone here managed to get context caching working for gemini-2.0-flash-001 on a free tier account? Gemini API gemini-flash , billing	19	720	July 10, 2025
Context cache not available for Gemini 2.0 Flash free tier? Gemini API api , gemini-flash	4	279	July 10, 2025

Context caching blocked on Paid Tier 1 — max_total_token_count=0

Related topics