Audio input in other languages than English: It works, but can I rely on this?

EirikW · November 22, 2024, 8:30am

In the documentation for audio, it states that “Gemini can only infer responses to English-language speech.” However, when I try this with Norwegian speech (even in a dialect) it works nicely! Is there a caveat or limitation I should be aware of, before relying too much on this capability?

Norwegian is a supported language, as listed here. So I am a bit confused about this conflicting documentation.

My use case (that is currently working) is that I upload audio in Norwegian, have a system prompt in Norwegian, and a prompt in Norwegian, asking Gemini AI (1.5) to transcribe and summarize the audio.

Shrushti_Patil · August 6, 2025, 10:36am

Hi @EirikW ,
Yes, Gemini 1.5 supports multilingual audio inputs.
Thanks!

Topic		Replies	Views
Using Gemini 2.0 As an STT agent Gemini API gemini-20	2	546	June 19, 2025
New Gemini Live API "Native audio output" models not supporting System Instructions Gemini API api , models , live-streaming	4	176	June 10, 2025
Has Anyone Gained Access to Gemini 1.5 Pro API? (Re: Gemini 1.5 Pro API's Multimodal Features) Gemini API	3	202	April 26, 2024
How many languages that google AI supported? Gemini API learning	2	63	April 15, 2025
Feedback on Voice Chat Language Selection Gemini API feedback	1	126	February 10, 2025

Audio input in other languages than English: It works, but can I rely on this?

Related topics