How to bias input_audio_transcription with a prompt in the Gemini Live API?

bibitto · July 10, 2025, 9:34am

Situation

I want the ASR to recognize domain-specific terms in our meetings.

What I did: passed a prompt (custom vocabulary + context) via System Instruction.
Issue: those terms are still mistranscribed, so the prompt seems to be ignored.

Questions

Is there an official way to supply a prompt, vocabulary list, or “hints” that truly affects input_audio_transcription?
If not, what workaround does Google recommend?
Does the recognizer run before any prompt conditioning, making System Instructions ineffective for ASR?

Krish_Varnakavi1 · July 10, 2025, 6:12pm

Hi @bibitto,

Welcome to the Google AI Forum!

Gemini’s input_audio_transcription does not support prompt conditioning or biasing via System Instructions in a reliable way. The transcription (ASR) is run independently before any prompt or context is processed.

In order to implement this solution specific to your use-case, please try speech-to-text API first and feed the transcribed text into Gemini..

Topic		Replies	Views
Live API Hangs When Using System Prompt with Audio-Only Response Modality Gemini API audio , gemini-flash	1	134	June 19, 2025
Expert opinion on System Instruction Gemini API	2	302	May 22, 2024
Customize how Gemini pro 1.5 exp responds Google AI Studio models	6	212	August 26, 2024
Prompts for ASR task Gemini API gemini-15 , api	0	89	June 18, 2024
The System Instructions Fiasco with Gemini vs competitors Gemini API api , model	1	363	January 3, 2025

How to bias input_audio_transcription with a prompt in the Gemini Live API?

Related topics