It gives you the transcript. You can find the solution here: Will it be possible to receive text and audio data in the multimodal API?
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Will it be possible to receive text and audio data in the multimodal API? | 13 | 964 | July 22, 2025 | |
| Why in Gemini Live API with Audio Modality its Transcription is not available in response | 5 | 258 | August 15, 2025 | |
| Gemini live api issue multimodal | 1 | 152 | October 10, 2025 | |
| outputAudioTranscription NOT WORKING WHEN [Modality.AUDIO] | 2 | 230 | June 19, 2025 | |
| Transcript on live audio not been passed back during conversation (ephemeral tokens auth) | 6 | 130 | October 13, 2025 |