Yeah it’s currently not working, but I found a work-around:
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Will it be possible to receive text and audio data in the multimodal API? | 13 | 986 | July 22, 2025 | |
| Why in Gemini Live API with Audio Modality its Transcription is not available in response | 5 | 270 | August 15, 2025 | |
| WebSocket Error 1007 When Requesting Simultaneous Audio + Text in Gemini Flash Models (AI voice Transcribing issue) | 0 | 57 | January 12, 2026 | |
| Gemini Live API not returning outputAudioTranscription event | 1 | 42 | February 11, 2026 | |
| Gemini-2.5-flash-native-audio-preview-09-2025 Text -> Text Only Not Working | 50 | 2230 | February 16, 2026 |