More audio file type support in (openai-compatible) api?

Sirui_Lu · April 3, 2025, 3:22pm

i am trying to use audio in the openai compatible api:
however i get:
openai.BadRequestError: Error code: 400 - [{‘error’: {‘code’: 400, ‘message’: ‘Invalid audio format “m4a” for audio generation. Valid formats are: [wav, mp3]’, ‘status’: ‘INVALID_ARGUMENT’}}]

although this says more format are supported:

Supported audio formats

Gemini supports the following audio format MIME types:

WAV - audio/wav
MP3 - audio/mp3
AIFF - audio/aiff
AAC - audio/aac
OGG Vorbis - audio/ogg
FLAC - audio/flac

and this (Audio understanding (speech only) | Generative AI | Google Cloud) says even more format are supported:

Audio MIME type	Gemini 2.0 Flash	Gemini 2.0 Flash-Lite
AAC - audio/aac
FLAC - audio/flac
MP3 - audio/mp3
MPA - audio/m4a
MPEG - audio/mpeg
MPGA - audio/mpga
MP4 - audio/mp4
OPUS - audio/opus
PCM - audio/pcm
WAV - audio/wav
WEBM - audio/webm

sorry for the quick typing. time constrained. but thanks in advance!

Sirui_Lu · April 3, 2025, 3:41pm

what puzzled me and prompted me to try was that this (m4a) works at ai studio…

jkirstaetter · April 3, 2025, 6:16pm

Hi,

The OpenAI comp runs against the Google AI Gemini API and there I do not see the m4a format as an accepted option either.

Cheers

Sirui_Lu · April 3, 2025, 7:40pm

is this an excuse for not accepting m4a? Aren’t we passing the base64 encoded version and a string for media_type?

Krish_Varnakavi1 · June 13, 2025, 5:09am

Hi @Sirui_Lu ,

Are you still facing this issue?

Topic		Replies	Views
Gemini 1.5 refuses to process audio files Gemini API gemini-15 , api , web-ml	8	490	September 19, 2024
Gemini 2.5 Flash doesn't have audio processing capability, but why? Gemini API ui , gemini-flash-2-5	3	207	June 4, 2025
Gemini flash 1.5 8B having an error with not generating content in audio file Gemini API models , audio	2	67	May 14, 2025
Python SDK generate_content_async error Gemini API api	2	120	October 14, 2024
Unable to select Audio and Video Google AI Studio	3	217	July 17, 2024

More audio file type support in (openai-compatible) api?

Supported audio formats

Related topics