Models/gemini-2.0-flash Video Understanding Model

Sohaib_Sajid · June 30, 2025, 9:01pm

When you make a call to Gemini’s new video understanding model, we need to pass in the URL every time. Does that mean the model is analyzing the video each time when we pass in a different prompt or want to talk about the video? If that is the case, it would require a lot of tokens and result in a very high cost. Does anyone have any experience working with this model?

GUNAND_MAYANGLAMBAM · July 1, 2025, 6:06am

Hi @Sohaib_Sajid , Welcome to the forum.

Gemini API is stateless, so it processes the input fresh with every request. To reduce costs, you can utilize context caching to avoid reprocessing the same data repeatedly. There is also a cookbook available you can refer to.

Thanks

Topic		Replies	Views
Gemini 2.0 - Video understanding Gemini API models , help_request	3	1972	May 2, 2025
Gemini 2.0 Flash Video Undestanding Issues Gemini API models , gemini-flash , gemini-20	2	277	June 19, 2025
Does Video Understading Clipping Feature Saves Usage? Gemini API api , gemini , video	1	32	January 13, 2026
Repeat video understanding context - best way to context cache? Gemini API api	2	34	February 2, 2026
Pricing mechanism Gemini API	4	212	October 10, 2024

Models/gemini-2.0-flash Video Understanding Model

Related topics