Metallic sounds using gemini-2.5-flash-preview-tts

Chris_Dool · December 11, 2025, 12:08pm

Since today (11-12-2025) i have a different voice (not really a problem) but also an annoying metallic pitch in the generated WAV. This was not a problem before today.

Today, 12-12-2025, after 8 minutes in a generated .WAV file, i thought i heard a fire engine outside. This was also in the WAV and not outside.

Is this related to the adjustment beneath and can i solve it or is this being resolved already?

effective December 10, 2025.

Starting on this date, you will automatically get significant improvements in expressivity, pacing, and overall audio quality. To ensure a seamless transition, the new models maintain many characteristics of the previous version.

What you need to do

No action is required from you.

No code changes required: Your existing API calls to gemini-2.5-flash-preview-tts and gemini-2.5-pro-preview-tts will automatically begin using the new model.

Kind regards,

Chris

Chuck · December 12, 2025, 6:43am

Seeing the same issue here. The voice timbre has changed overall as well.

Chris_Dool · December 12, 2025, 8:45am

Also metallic sound, mostly after couple of minutes, not directly?

Adam_S · December 12, 2025, 5:12pm

Same here. If generating several minutes of speech, after a few minutes it sounds more metallic / degraded. ALSO, the PACE increases as the speech progresses. I gave very specific performance notes to instruct that the pace should be even throughout the entire script, but it does not work. Metallic sound and uneven pace needs to be addressed.

Chris_Dool · December 12, 2025, 9:43pm

Same here regarding increasing speed

Hannes_Bertran · December 13, 2025, 1:14am

I’ve also noticed those issues.

RubenS · January 2, 2026, 2:24pm

I am facing the same issues too.

Pooja_Kapse · January 2, 2026, 3:09pm

Hi @Chris_Dool,
Thank you for bringing this to our attention. We truly appreciate you flagging this issue, and we have escalated it to the relevant team for further investigation.

phil_swenson · January 7, 2026, 2:34pm

@Pooja_Kapse could you give some sort of timeline for a fix? this has been a problem since November now.

thanks!

tusharf5 · January 10, 2026, 1:16am

The tts output is totally unusable at this point since the Dec 10th update.

Chris_Dool · January 10, 2026, 8:19am

I agree the output is unusable

Adam_S · January 19, 2026, 2:17pm

Yes, for months now, the problems persist, as if google never tests their own product. I generate tts a couple times a day, and the results are almost always the same, for any voice used. I typically generate a few minutes of audio per prompt:

The voice quality and pace start by sounding great for the first 25-50% of the results.
The voice quality gets worse, like the voice is coming from a tin can, AND the pace quickens verry noticeably.

The fact that this occurs constantly, with no fix after months, really does seem as if google does not use or support this model, or perhaps it is even intentional for some reason?

Andew_Carsten · January 20, 2026, 2:11am

I’ve noticed similar behavior as well, especially on longer generations. The gradual drop in audio quality and change in pacing feels consistent enough to suggest a systemic issue rather than random errors. It would be interesting to hear if others see this too, or if Google has acknowledged it anywhere.

Adam_S · January 27, 2026, 4:04pm

Here is a prime example of the two main problems I see fairly consistently with longer tts generations: Eventual quality degradation, and faster pace as the speech evolves over time.

nicholsss · January 30, 2026, 8:20am

Same issue here! The voice gets quite bad after 2-3 minutes, and if I try to chunk audio into smaller pieces, the voice totally changes for the next chunk.

Adam_S · January 30, 2026, 11:31pm

Day after day, it’s the same results. No word from Google, no changes in the model behavior. At this point, no longer worth commenting on, and will just assume this is an abandoned model..

nicholsss · February 2, 2026, 5:05pm

Has anyone been able to resolve this, or is it entirely a model-related issue? Are there any solutions other than changing providers?

Dean_Dodds · May 13, 2026, 7:02pm

I’m also seeing exactly the same issue with 2.5 TTS. I don’t want to move to the newer TTS model as it’s 2x the cost. After 4 mins the metallic voice and quickened pace is truly awful

Topic		Replies	Views
Gemini TTS voices have changed Gemini API gemini	5	290	January 10, 2026
Persistent Noise in TTS Audio Generation Google AI Studio ai-studio , text	16	1442	March 28, 2026
Gemini-TTS noticeably worse - cracking, sizzling, scratching background noise Gemini API generative-ai , gemini-flash	2	586	November 11, 2025
TTS audio generation background noise Google AI Studio gemini-flash , gemini_25_pro	14	993	May 12, 2026
Why was the TTS model nerfed on December 10th? Gemini API audio	10	350	February 19, 2026

Metallic sounds using gemini-2.5-flash-preview-tts

What you need to do

Related topics