Has anyone here managed to get context caching working for gemini-2.0-flash-001 on a free tier account?

limcheekin · April 28, 2025, 3:49am

Hi there,

According to this pricing page, the context caching is supported for gemini-2.0-flash-001 even for free tier account:

I tried it, but getting the following error:
google.genai.errors.ClientError: 429 RESOURCE_EXHAUSTED. {‘error’: {‘code’: 429, ‘message’: ‘TotalCachedContentStorageTokensPerModelFreeTier limit exceeded for model gemini-2.0-flash: limit=0, requested=116251’, ‘status’: ‘RESOURCE_EXHAUSTED’}}

Please advise. Thank you.

Kiran_Sai_Ramineni · April 28, 2025, 5:57am

Hi @limcheekin, You will face this error when you send too many requests per minute with the free tier Gemini API. Please try after some time. Thank You.

Ayoub_Kazar · April 28, 2025, 10:30pm

@Kiran_Sai_Ramineni Here’s this stat from an account with all 0’s
From the first request ever with this account
i get the same error as above (only when using cache context)

Any suggestions ?

limcheekin · April 29, 2025, 4:15am

Thanks for quick response.

I faced this error from the first request too.

limcheekin · April 29, 2025, 4:17am

Thanks for sharing the screen.

Where could I find this screen?

Kiran_Sai_Ramineni · April 29, 2025, 5:06am

Hi @Ayoub_Kazar @limcheekin, May I know the number of tokens you are trying to cache. Also could you please confirm you are facing this error while creating the cache or while making a request on the cached content. Thank You.

limcheekin · April 29, 2025, 11:57am

I am facing this error when the app creating the cache.

I think the number of tokens is stated in the error message above ‘requested=116251’.

Hope this clarify.

Thank you.

Long_Peng · April 29, 2025, 2:21pm

Hi, I try to create a cache for gemini-2.0-flash, but I always receive HTTP 503, do you meet the same issue? Thank you!

Ayoub_Kazar · April 29, 2025, 2:24pm

This is on creating the cache
Here’s the number of tokens: 298263 tokens

429 RESOURCE_EXHAUSTED. {'error': {'code': 429, 'message': 'TotalCachedContentStorageTokensPerModelFreeTier limit exceeded for model gemini-2.0-flash: limit=0, requested=298263', 'status': 'RESOURCE_EXHAUSTED'}}

Ayoub_Kazar · April 29, 2025, 2:25pm

No, recheck the docs on how to create a cache, then maybe you’ll reach our issue too lol

limcheekin · April 30, 2025, 12:08am

@Long_Peng @Kiran_Sai_Ramineni Please checkout the following code I used to create the cache at TelegramGPT/gemini.py at main · limcheekin/TelegramGPT · GitHub. The same code working for gemini-1.5-flash-002.

Hope this help to resolve the issue faster. Thanks.

tradanghi1999 · May 1, 2025, 9:00am

Got the same issue.

My account is on free tier.

Kiran_Sai_Ramineni · May 2, 2025, 11:09am

Hi @Long_Peng, @Ayoub_Kazar @limcheekin While reproducing i have also received the same error while using the free tire account. will escalate this issue to the engineering team. Thank You.

limcheekin · May 2, 2025, 11:58am

Great! Please keep us posted once the issue has been resolved.

Thanks.

limcheekin · May 11, 2025, 2:21am

@Kiran_Sai_Ramineni May I know any update for this issue?

SecurityCze · May 16, 2025, 12:05pm

Hello @Kiran_Sai_Ramineni ,
do we have a workaround/solution for this issue?

I am currently encountering this issue and I have a deadline coming up where I wanted to use the 2.0 Flash model.

Or if anyone managed to get explicit context caching working please let me know.

Sergio_Chavez_Lazo · May 25, 2025, 4:30pm

Hi! Any updates on this? I am facing the same issue now

Nathaniel_Polarski · June 10, 2025, 12:19pm

Any updates on this?

Sebastian · June 19, 2025, 4:27pm

The issue is still here. I had a Similar Issue about two months ago with gemini flash 1. Somehow the documentation is also not always correct/up to date.

vanshksingh · July 10, 2025, 8:32pm

I have listed it on the issue tracker Google Issue Tracker

Topic		Replies	Views
Context cache not available for Gemini 2.0 Flash free tier? Gemini API api , gemini-flash	4	241	July 10, 2025
Over 300k context tokens lead to 429 error for both Gemini-pro and flash on free tier Gemini API api , gemini-flash , gemini-2-5	4	356	August 25, 2025
429 error with quota with tier Gemini API ai-studio , api , gemini	38	1105	January 3, 2026
Total Cached Content Storage Tokens Per Model Free Tier limit exceeded Gemini API cloud , api-key	5	318	August 25, 2025
Gemini API 429 Error Despite Low Quota Usage on Paid Tier (gemini-2.5-flash) Gemini API bug , rate-limits	25	819	January 13, 2026

Has anyone here managed to get context caching working for gemini-2.0-flash-001 on a free tier account?

Related topics