Audio file doesn't load correctly

Matteo_Ferrario · December 20, 2024, 8:37pm

I am using Gemini 1.5 Flash on a Google Cloud (located in the US), and am not able to load a file (but I can do it locally). Locally, as said, it works and returns me the correct output. On GC, I get the message that the file has been uploaded, but then Gemini answers as it doesn’t have it. One hypotesis is that this problem occurs because I have my Gemini account based in Italy, and the GC functions run in the US. Another could be that the file may be corrupt.
Here is the uploaded file’s specifics that I printed in the GC terminal:

Audio file uploaded: genai.File({
    'name': 'files/fxjmrf7j67we',
    'display_name': 'audio_file',
    'mime_type': 'audio/mpeg',
    'sha256_hash': 'OTMyZjllZmU2NjJiMzNlNTUxYjEyZDI3Nzc2Y2NjYzk2OGI5N2EzN2JhYTYzZjQ5NTM5ZWY1YWI5MWY4MzM1NQ==',
    'size_bytes': '13293',
    'state': 'ACTIVE',
    'uri': 'https://generativelanguage.googleapis.com/v1beta/files/fxjmrf7j67we',
    'create_time': '2024-12-20T20:24:07.131807Z',
    'expiration_time': '2024-12-22T20:24:07.124177437Z',
    'update_time': '2024-12-20T20:24:07.131807Z'})

If I try to open the uri of the file, I get this (but I think that’s normal):

{
  "error": {
    "code": 403,
    "message": "Method doesn't allow unregistered callers (callers without established identity). Please use API Key or other form of API consumer identity to call this API.",
    "status": "PERMISSION_DENIED"
  }
}

And here’s the function:

def summary(audio_path):
    genai.configure(api_key=GEMINI_API_KEY)
    
    try:
        audioFile = genai.upload_file(audio_path, display_name="audio_file")
        print(f"Audio file uploaded: {audioFile}")
    except Exception as e:
        print(f"Error uploading audio file: {e}")

    generation_config = {
        "temperature": 1,
        "top_p": 0.95,
        "top_k": 40,
        "max_output_tokens": 8192,
        "response_mime_type": "text/plain",
    }

    model = genai.GenerativeModel(model_name='gemini-1.5-flash', generation_config=generation_config)

    prompt = """Summarise this audio file."""

    try:
        response = model.generate_content([prompt, audioFile])
        response_text = response.text
        print(f"Model response: {response_text}")
    except Exception as e:
        print(f"Error generating content: {e}")
        return None

    return response_text

Shrushti_Patil · July 18, 2025, 7:06am

Hi @Matteo_Ferrario ,
This issue may be due to region mismatch or file access timing. Ensure the file URI is still valid when calling Gemini and try specifying project and location explicitly in the API call. Also check the audio format MIME types supported by Gemini.
Refer- Audio understanding | Gemini API | Google AI for Developers
Thanks!

Topic		Replies	Views
Gemini-1.5-flash is no longer processing audio files (500 Exception) - retry does not help Gemini API gemini-15 , bug , models , audio	4	104	April 9, 2025
Error processing file: 500: Internal Server Error Gemini API api , models	2	517	August 14, 2024
I'm getting an error when asking Gemini with text+files Gemini API gemini-flash	1	845	December 15, 2024
Gemini file upload not working Gemini API api	2	455	February 20, 2025
500 error when including a file Gemini API api , model	12	255	September 17, 2024

Audio file doesn't load correctly

Related topics