Persistent Noise in TTS Audio Generation

Aizen_Sosuke · September 15, 2025, 5:39pm

I’m experiencing a persistent issue with the TTS (Text-to-Speech) functionality of the Gemini API. My TTS generation, which previously worked perfectly, now consistently produces a low-level hissing or static noise that is present in all generated audio.

This noise is particularly noticeable when the voice is speaking and is almost completely absent during silent pauses. This suggests the issue is an artifact of the voice generation itself, rather than simple background noise.

I’ve been working with this API for a while, and the problem appeared suddenly around September 12-13, 2025.

Jhonny_Chambers · September 19, 2025, 4:03am

I had the same problem on September 18th.
Were you able to fix it?

Aizen_Sosuke · September 19, 2025, 8:05pm

well idk how to fix it cuz i can’t write code but i’m using auphonic to fix it however it takes more time

Piotr_Jarecki · September 22, 2025, 1:37pm

I have this hissing for quite a while - I posted on X and tried tagging - I found out that the longer the generation is lasting the more the hissing gets in there. Seems like a model issue as it persists on all voices.
Found no way to solve it…

Rafael_Silva · September 23, 2025, 8:26pm

Same thing here. This issue is really annoying. I used to create high quality content but now the problem persists and quality is really poor. I hope they fix it anytime soon.

Pannaga_J · September 25, 2025, 6:57am

Hi guys @Aizen_Sosuke @Piotr_Jarecki @Rafael_Silva @Jhonny_Chambers
Wanted to reproduce from my end . could you please confirm if this is happening with both gemini-2.5-flash-preview-tts and gemini-2.5-pro-preview-tts? Additionally, does this occur with all audio lengths, and if so, please provide the token count at which the hissing sound starts. Any additional details you can provide about the conditions under which this occurs would be very helpful
Thank you

Rafael_Silva · September 25, 2025, 5:39pm

@Pannaga_J yes, both gemini-2.5-flash-preview-tts and gemini-2.5-pro-preview-tts seem to have the same issue. I create short length audios, and the hissing has always been there since Sept 15. Thanks!

Pannaga_J · October 4, 2025, 6:43pm

Thanks for flagging this. We have updated the concerned team about it .

Raff_Silver · October 13, 2025, 6:19pm

Hello guys, any recent updates on this issue?

Has anybody (user / developer) been able to find a solution?

Thanks!

Efectate · December 10, 2025, 2:10am

Same promblem for me. It impossible to use because of noise. Is there anybody solved this?

Raff_Silver · December 16, 2025, 6:48pm

Nothing so far. Unfortunately.

Nasir_Baki · January 2, 2026, 1:43am

The issue is still there. The voice gets robotic with static noises the longer the audio is. The voice loses it’s consistency. Try generating something that is above 1k words.

Gng_Jnck · January 7, 2026, 4:05am

Still the same, maybe that’s why the model is still a preview? Not final yet?

Zenmind1 · January 10, 2026, 1:11pm

2026 and the problem is still there. The hissing, slightly metallic sound can be heard about 60% from the start. Sometimes if you refresh the page and re-create the audio, the subsequent audio does not have the hissing sound. Sometimes it persists and you have to close and open the browser. Even then it words only sometimes.
If you paste a long block of text, say 2300 words (14 - 15 minutes), it gives you only 10:55min of audio, so you have to paste the remaining block of text again to create the remaining 4 to 5 minutes of audio. The hissing can be heard in both clips, the long one and short one.

I’m surprised Google still hasn’t solved this issue.

Adam_S · February 2, 2026, 6:38pm

Yes, a similar conversation here - I’ve given up on the model at this point… : Metallic sounds using gemini-2.5-flash-preview-tts

Adam_S · February 2, 2026, 6:41pm

Duration for the output audio	Approximately 655 seconds. If the input text results in the audio exceeding 655 seconds, the audio is truncated.

S_uP · March 28, 2026, 3:12pm

March 2026 and the issue remains. Even on google’s page when executing the example audio it has such white noise (https://aistudio.google.com/generate-speech)

Topic		Replies	Views
TTS audio generation background noise Google AI Studio gemini-flash , gemini_25_pro	14	1194	May 12, 2026
Gemini TTS API Static Noise Google AI Studio ai-studio , models	0	45	March 31, 2026
Gemini-TTS noticeably worse - cracking, sizzling, scratching background noise Gemini API generative-ai , gemini-flash	2	625	November 11, 2025
[Bug] Severe Audio Quality Degradation (Static/Noise) for Thai TTS Output (AI Studio/API) since Dec 10, 18:00 ICT Google AI Studio feedback , bug , api , vertexai , generative-ai	3	229	December 19, 2025
Metallic sounds using gemini-2.5-flash-preview-tts Gemini API api , gemini-flash	17	592	May 13, 2026

Persistent Noise in TTS Audio Generation

Related topics