So I made an App to generate lyrics to feed to Suno. Suno then makes a sing as MP3 and MP4, and this topic is purely related to Gemini’s ability to analyze an MP4 file!
So, I used the “Deep Research” option in Gemini to analyze the lyrics I’ve generated for this song. And POOF, I get a nice report detailing the strong and weak points in it. Nice!
But I also want “Deep Research” to analyze the sound file (MP4), and it tells me it cannot do this. #Seriously? Anyways, this is just a minor annoyance…
However, if I just use Gemini 2.5 Flash without the “Deep Research” with just the MP4 file, I receive a nice analysis including the lyrics. So Gemini can analyze MP4 files! It even gives an opinion about the song, like “The song “Wagenburg Elegie” is a highly effective and compelling piece of dramatic storytelling and music.” and it also noticed that it was created by SUNO, an AI music engine. And it is interesting as I did not give Gemini the lyrics itself, so it extracted them from the MP4 in some way.
But if Gemini can analyze an MP4 file, then why can’t “Deep Research” do the same? It’s the same AI engine, isn’t it?