Gemini downsamples audio files to a 16 Kbps bitrate?

mihail_6578 · September 26, 2025, 3:17pm

I want to use gemini-2-5-flash for transcribing audio files.
My files are 8kHz with 16 kbps. Before throwing them to the model, I do some preprocessing to the audios (separate only left channel + remove the silent parts from it). After the preprocessing I need to save those files, so I have some questions in what format should I save those modified files.
My questions are:

Should I resample audios from 8kHz to 16kHz? Does it make any difference to Gemini?
What bitrate should I use? In the documentation it is stated that Gemini downsamples audio files to a 16 Kbps data resolution. Does it mean it downsamples every input to 16 kbps bitrate or it just means it downsamples sample rate to 16kHz or it refers to the bit depth 16-bit PCM?

I find it kind of hard to believe it downsamples every input to 16 kbps bitrate.

Thanks

Topic		Replies	Views
About gemini audio input Gemini API audio , gemini-flash	2	139	March 19, 2025
Is audio in videos really processed at 1Kbps and not 16Kbps? Gemini API api , gemini-api	1	61	July 7, 2025
Gemini 1.5 refuses to process audio files Gemini API gemini-15 , api , web-ml	8	531	September 19, 2024
What is the limit on audio length when using Gemini API to do ASR task? Gemini API	1	195	July 2, 2024
Transcribe text to text and vice versa, speech to speech and image to text in a flutter app using gemini Gemini API	15	689	May 20, 2024

Gemini downsamples audio files to a 16 Kbps bitrate?

Related topics