Video misclassification with Gemini 2.0, why?

nikhilkuppa · May 12, 2025, 2:16pm

Hi!

I’m using Gemini 2.0 flash for a multilabel video classification task, with a very detailed prompt to define the set of categories the model can classify the video as. I’ve found the results to be fairly aligned with my prompt definition at a temperature of 0.4, however there was not enough consistency (while most categories had exactly the type of videos I wanted in them, at scale most videos were also being misclassified).

To address this issue, I tried both temperatures of 0 and 0.2 to increase consistency in the results, but I now find that they barely align with my prompt. The classifications still make sense, but they’re not exactly following the explicit steps i assign for the task. What could be the reason for this, and how could I approach this problem to get more consistent results across similar videos?

I’d appreciate any and all technical advise on this issue, thank you!

Akhilesh_Kambhampati · May 12, 2025, 5:37pm

@nikhilkuppa,

The “temp” and “top_P” only effects the decoder part of the transformer. something i would try is the split the task into two,

step 1: Describe the video in details lets say under 500 words.(depending on the length of the video )
step 2: Now use the above generated description to classify the video

this way you might be able to understand and tweek the temp and top-P of the first step to get the required info or keywords from the description that will help you with classification.

Topic		Replies	Views
Gemini 2.0 Flash Video Undestanding Issues Gemini API models , gemini-flash , gemini-20	2	262	June 19, 2025
Gemini Flash(thinking)/Pro end with meaningless character when temperature is 0 Gemini API bug , gemini	1	137	June 4, 2025
Wildly inaccurate results after fine-tuning Gemini API fine-tuning , classification	3	203	October 26, 2024
Did anything change with gemini-2.0-flash yesterday? Gemini API models , gemini-20	7	204	June 25, 2025
Video Understanding response cut off at token ~= 2k Gemini API bug , api , video	3	142	June 25, 2025

Video misclassification with Gemini 2.0, why?

Related topics