Gemini Live API models 'inputTranscription' hallucinations

javimp03gem · October 21, 2025, 9:04am

Hi everyone,

We’re currently testing a custom Gemini Live model on Vertex AI, and we’ve noticed an issue where the model sometimes generates responses without receiving any actual input.

After some debugging, it looks like this behavior is caused by random input transcription detections that don’t come from the user. These unexpected transcriptions usually follow the same pattern:

Two transcriptions with unrecognizable content (literally random),
Followed by one transcription with content "None" or whatever.

From our microservice backend we are handling explicitly the turns and input sending to Live API and logging on the terminal the VAD detections. When this random issue happens NO VAD logging is shown, so that it indicates that it is a Google API error/problem. We haven’t been able to find a clear cause, and we suspect it could be related either to the API itself or to our model deployment setup.

Below is an example trace of one of these cases: you can see the setupComplete event (the first session setup) immediately followed by an unexpected inputTranscription event.

Example log summary (system running in Spanish)

Below is a simplified extract from our logs that shows the issue:

The session starts normally:
setupComplete (session established)
Twilio WebSocket connected
6 clients loaded, TTS ready
Immediately after setup, the API sends:
inputTranscription: “या”
inputTranscription: “, 2, 3, 4, 5, 6,”
inputTranscription: " 7, 8 9, 10, 11,"
inputTranscription: " 12"
inputTranscription: None
The model then generates an unexpected response
modelTurn: “Usted disculpe, no le he entendido muy bien. ¿Podría, por favor, repetir lo que dijo?”

Has anyone experienced something similar or knows what could be happening?
Any help or pointers from the Vertex AI / Gemini team would be greatly appreciated!

Thanks in advance

Topic		Replies	Views
Gemini Live API: Delays or Missing input_audio_transcription Events Gemini API bug , api , models , gemini	12	505	January 9, 2026
Gemini Live API: token generation suddenly stops Gemini API ai-studio , api , audio , live-streaming	13	974	October 8, 2025
Input transcription in gemini live api is very weird Google AI Studio live-streaming , gemini-2-5	2	201	January 16, 2026
Inconsistent Response Behavior in gemini-2.5-flash-native-audio-preview-09-2025 Voicebot Gemini API ai-studio , live-streaming	5	735	January 7, 2026
Gemini Live API experiencing audio glitches Gemini API models , live-streaming	1	68	February 20, 2026

Gemini Live API models 'inputTranscription' hallucinations

Example log summary (system running in Spanish)

Related topics