Hi Gemini API Team,
I’m hitting a provisioning issue where context caching is completely disabled for my project despite being on an active Paid Tier 1 account with billing confirmed.
The error:
400 INVALID_ARGUMENT: Cached content is too large. total_token_count=1328389, max_total_token_count=0
Key details:
-
Account is Paid Tier 1 with active billing
-
max_total_token_count reports as 0 for all models
-
Caching works for smaller content (1-2 chunks) but fails at 3+ chunks (~1.3M tokens)
-
Model: gemini-2.5-pro
This appears to be the same provisioning sync issue reported in other threads where the cache quota is hardcoded to 0 despite an active paid account.
Can someone check why the cache quota is set to 0 for this project and force a resync?
Thank you