Gemini 2.0 - Video understanding

Prasad_Maduanga · December 31, 2024, 6:09am

How are videos encoded when inputting them into Gemini models? What are some tips for achieving the best results in video understanding?

Does the video resolution matter?
Does the frame rate of the video matter?
How are the videos encoded and fed into the model? Does it encode all frames or skip frames in the middle?
Is there a specific version of Gemini that works best for videos?

camadi · January 3, 2025, 6:48pm

Hello and welcome to the community.

While I’m unable to answer all your questions directly, please refer to the Video understanding notebook. For instance, you’ll realize that video understanding works best with the Gemini 2.0 Flash model, and you can also test other models to compare their performance. Please try it out and let us know if you still have questions.

Telaya_Garza · January 12, 2025, 5:33am

Here’s what you need to know:

Higher resolution provides more detail but also requires more resources. You’ll need to balance the resolution based on your task and computational limits.
Frame rate matters because more frames provide more temporal detail, but higher rates can be computationally expensive. Lower frame rates (e.g., 1 frame per second) work well when high temporal resolution isn’t necessary.
Videos are typically encoded by sampling frames at set intervals, like 1 frame per second, skipping intermediate frames to reduce data load without losing crucial information.
Versions like Gemini 1.5 Pro are optimized for video understanding and offer solid performance across different modalities, including videos.

By adjusting these factors—resolution, frame rate, encoding, and choosing the right version—you can optimize the performance of your Gemini model for video understanding.

Akshaya_Dinesh · May 2, 2025, 7:05pm

Is there a way to pass in a video quality parameter to the API when passing in a YouTube video URL instead of a local video file? I’d like to use a lower resolution

Topic		Replies	Views
Optimal Video Pre-processing Parameters (FPS, Resolution) for File API Gemini API api , models	2	292	July 3, 2025
FPS Processing Capability of Gemini 2.5 Flash for Video Fine-Tuning Gemini API fine-tuning , gemini , ai , machine-learning , video	3	138	August 5, 2025
mediaResolution 'low' returns an error Gemini API api , video	4	240	June 13, 2025
Video processing - Best approaches towards analyzing large videos? Gemini API api , video	6	179	August 26, 2025
I need 5+ FPS when uploading a video over 20mb (Gemini 2.5 Pro) Gemini API api , video	4	108	August 22, 2025

Gemini 2.0 - Video understanding

Related topics