Gemini Live Caching

IKGN · May 14, 2025, 11:59pm

Is audio context billed at the same $2.10 per million tokens? Is audio context caching planned? Thanks

Akhilesh_Kambhampati · May 15, 2025, 5:43pm

@IKGN, audio is processed at 32 tokens/sec and the charge will be based on token cost of the model you are using.

as for caching, you can cache the audio please check the reference here Context caching | Gemini API | Google AI for Developers

IKGN · May 15, 2025, 11:07pm

Thanks, I’m asking about Gemini Live. I’m speaking about internal audio context of this model. The complete cumulative context seems to be billed on each conversation turn, not only the actual new audio I send during the conversation. I don’t think I can currently cache this audio?

Vishal · May 15, 2025, 11:53pm

Hey @IKGN - for billing purposes in the Live API, all tokens are counted at every turn, including new tokens from the latest prompts and tokens from the previous context.

I’ll file this (the ability to cache tokens and only be billed for new tokens) with the team as a FR.

IKGN · May 16, 2025, 12:50pm

Thank you very much! It makes a huge price difference, the quadratically increasing audio context price dominates all other costs. My app needs long conversations. If the context could be cached, it would help me a lot!

Owais_Lone · February 18, 2026, 8:37am

This would be super helpful. Without this, live audio is basically unusable for a lot of cases. Just a single minute of live audio can consume up to 20k tokens or even more depending on the prompt etc. What Google pricing hints is that 1 min should cost 1920 tokens but in reality the token consumption explodes as it keeps processing all previous audio + instructions + tools (I think) every single turn.

Topic		Replies	Views
How does Gemini Realtime API handle billing for audio input reused in conversation history, and how do cached tokens work in this context? Gemini API api , gemini , live-streaming	0	89	October 6, 2025
Live API Pricing - Audio tokens / second & silent audio Gemini API gemini-api , live-streaming	2	354	July 6, 2025
Understanding Gemini Multimodal Live context and pricing Google AI Studio gemini	1	335	May 21, 2025
Could someone help me understand gemini live pricing? Gemini API api , models , billing	1	387	June 23, 2025
Cost estimation for audio input and text output Gemini API gemini-15 , api	3	651	July 5, 2024

Gemini Live Caching

Related topics