Gemini TTS Multi-Speaker Mode: 7 Critical Bugs After 3 Weeks in Production (finishReason 'OTHER', Truncation, Voice Swapping, Hallucinated Lines)

A_O1 · May 5, 2026, 5:29pm

Experiencing the same problem here.

We hit the same finish-reason and truncation symptoms running gemini-3.1-flash-tts-preview for non-conversational audio tour generation (single speaker, no multi-speaker config). Spent the last week or so quantifying it across endpoints — sharing the data in case it’s useful for the team triaging this.

Setup: 10 real production tour scripts, 1421–1705 chars each, voice = Charon. Single-input synthesis, not multi-speaker. Same SDK config across runs (timeout=90, retry=None).

Results across the four endpoints that accept this model:

endpoint: aiplatform.googleapis.com/.../streamGenerateContent (SSE)
success rate: 80% full audio, 20% silently short
finish reasons on success: mix of STOP and OTHER
notes: OP’s “OTHER on truncation” 100% reproduces here\
────────────────────────────────────────
endpoint: texttospeech.googleapis.com/.../streamingSynthesize (Cloud TTS bidi)
success rate: 3/10 to 5/10 across re-runs
finish reasons on success: STOP when works
notes: rest fail with 400 InvalidArgument: violates Vertex AI’s usage guidelines, support code 54702341 =
Unspecified
────────────────────────────────────────

Is there any way we can reach out to the team or get updates on whether this is being worked on and we should wait for a fix?

Topic		Replies	Views
Gemini 3.1 Flash Live - Voice slowly changing, massive audio quality + volume dropping on TTS requests longer than ~1 minute Gemini API bug	1	223	May 9, 2026
Issues with gemini-tts-2.5-pro in AI Studio (blank audio, voice drift, pacing changes) Google AI Studio models , audio	0	181	March 16, 2026
Gemini 3.1 Flash TTS SSE sometimes returns exactly 20s / 1,280,000 base64 chars and truncated audio Gemini API api , gemini-api , gemini , gemini-flash	0	32	May 14, 2026
Gemini 2.5 Native Dialog audio problems Gemini API ai-studio , audio , gemini-flash-2-5	35	2493	January 28, 2026
Gemini TTS ignores per-speaker voice settings in multi-character prompts Google AI Studio bug	9	903	November 28, 2025

Gemini TTS Multi-Speaker Mode: 7 Critical Bugs After 3 Weeks in Production (finishReason 'OTHER', Truncation, Voice Swapping, Hallucinated Lines)

Related topics