Gemini Live API: print the transcripts

Tina_Jasmine · July 15, 2025, 5:30pm

I’m using this Gemini Live API tutorial now: cookbook/quickstarts/Get_started_LiveAPI.py at c063f0dbf13aa0da5f4b75931e174f9e02f16bce · google-gemini/cookbook · GitHub

Is there a way, maybe a flag to set, to also print the transcript of our speech too?
AWS Nova Sonic shows that by default. I need something similar here too.

GUNAND_MAYANGLAMBAM · July 16, 2025, 4:14am

Hey @Tina_Jasmine , you can pass the input_audio_transcription parameter in the config to retrieve the input audio transcription.

config = {
    "response_modalities": ["TEXT"],
    "input_audio_transcription": {},
}

Please refer to the documentation for more details.

Thanks.

Tina_Jasmine · July 16, 2025, 4:34am

But when I add it, voice doesnt work anymore. How can I keep both?

GUNAND_MAYANGLAMBAM · July 17, 2025, 10:02am

Hey, I just checked on my end and it worked fine, You just needed to update the receive_audio function.

async def receive_audio(self):
    "Background task to reads from the websocket and write pcm chunks to the output queue"
    while True:
        turn = self.session.receive()
        async for response in turn:
            if data := response.data:
                self.audio_in_queue.put_nowait(data)
                continue
            if text := response.text:
                print(text, end="")
            if response.server_content.output_transcription:
                print("Transcript:", response.server_content.output_transcription.text)
            if response.server_content.input_transcription:
                print('Transcript:', response.server_content.input_transcription.text)

        while not self.audio_in_queue.empty():
            self.audio_in_queue.get_nowait()

Topic		Replies	Views
Audio transcript in Gemini Live API not really working Gemini API api , gemini-api	5	177	November 25, 2025
Why in Gemini Live API with Audio Modality its Transcription is not available in response Gemini API audio , live-streaming	5	260	August 15, 2025
Will it be possible to receive text and audio data in the multimodal API? Gemini API models , gemini-api	13	972	July 22, 2025
Live API + Ephemeral Token: No Input/Output Transcription (Audio replies work but no transcription events) Google AI Studio ai-studio , audio	1	65	December 30, 2025
Adding Chat history problem Documentation api , prompt , gemini-flash-2-5	1	68	November 3, 2025

Gemini Live API: print the transcripts

Related topics