Hi everyone,
I’m building a real-time Hebrew voicebot using the gemini-2.5-flash-native-audio-preview-09-2025 model, and I’m running into inconsistent behavior that I can’t fully explain.
The issue:
Sometimes the model simply doesn’t answer at all. The bot receives the audio input, but there’s no response from Gemini. After several attempts (sometimes 3–5 retries), it suddenly responds normally. Other times, the entire flow works perfectly from the first message, without any delays or failures.
What I’ve confirmed so far:
• The audio stream is being sent correctly
• The STT + request payload is valid
• No errors are returned from the API
• The problem is intermittent and unpredictable
• When it works, it works flawlessly
What I’m trying to understand:
• Is this a known issue with the current preview model?
• Are there recommended settings, timeouts, or event-handling mechanisms to improve stability?
• Could this be related to rate limits, streaming configuration, or model warm-up behavior?
• Is there any diagnostic logging I should enable to better understand the silent failures?