Frustrated, audio analysis was working great now its not. help!

so as you can see in the screenshots, i was able to have gemini very accuratley analyze several .mp3 files, music, one was in arabic, and the last was garbled nonsense. IT was able to “hear” and answer my prompts about each file. As of a few hours ago, it wont work with any uploaded music file. Continues to say its a text only model, and says it was “role playing” and just got into it when i ask why it could analyze those files a few hours ago. It just talks in loops, ive tried this while on the same prompt as earlier music classification, new prompt, its frustrating as this was the basis on an app im working on. Anyone got any input? Its funny cause chatGPT could do this last summer, it could name a song, talk about it, anything. Then one day it said exactly this, that its a text only model. im so frustrated! @GoogleLLC

Welcome aboard! I tried one of my standard test cases, the preamble to the US Constitution, and today Gemini 1.5 truncated the transcription to

We the people of the United States in order to form a more perfect union establish justice ensure domestic tranquility provide for the common defense P

It used to reliably generate the entire preamble, which is well recorded in the audio file.

Another audio file that used to work was rejected with RECITATION block.

Updating the status of this issue. The preamble test case was resolved.
This audio file used to reliably work before the May 14 model update: Arthur the Rat – Dictionary of American Regional English – UW–Madison , actual audio file at Audio file in mp3 format.

It has since the update reliably generated a RECITATION block. The audio file is useful to evaluate the model, since a known good transcript is also available Reference transcript for audio file.

1 Like

The screen image shows a red triangle next to the Model response.

If you clicked on that - what did it say?

1 Like

It’s a Sexually Explicit language

1 Like