It gives you the transcript. You can find the solution here: Will it be possible to receive text and audio data in the multimodal API?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Will it be possible to receive text and audio data in the multimodal API? | 13 | 785 | July 22, 2025 | |
Why in Gemini Live API with Audio Modality its Transcription is not available in response | 4 | 123 | June 11, 2025 | |
outputAudioTranscription NOT WORKING WHEN [Modality.AUDIO] | 2 | 101 | June 19, 2025 | |
How to get text output from gemini-2.5-flash-preview-native-audio-dialog | 3 | 222 | July 10, 2025 | |
Retrieving transcribed audio input prompt with reply | 1 | 149 | July 24, 2025 |