Custom pronunciations not working with Chirp3 HD voices

Niran_Pravithana · September 23, 2025, 3:09pm

I’m trying to use custom pronunciations with Google Cloud TTS following the exact example from the Chirp3 HD documentation:

{

"input": {

  "text": "There is a dog in the boat",

  "custom_pronunciations": {

    "phrase": "dog",

    "phonetic_encoding": "PHONETIC_ENCODING_X_SAMPA",

    "pronunciation": "\\"k{t"

  }

},

"voice": {

  "language_code": "en-US",

  "name": "en-us-Chirp3-HD-Leda"

},

"audio_config": {

  "audio_encoding": "LINEAR16"

}

}

The API call succeeds and returns audio, but the word “dog” is still pronounced as “dog” instead of “cat” (k{t in X-SAMPA).

Questions:

1. Do Chirp3 HD voices actually support custom pronunciations?

2. Is there something wrong with the request format?

3. Are there any undocumented limitations?

The documentation shows this exact example but it doesn’t work in practice. Any help would be appreciated.

Aciax_Hls · September 23, 2025, 5:08pm

{
“input”: {
“text”: “There is a dog in the boat”
},
“custom_pronunciations”: [
{
“phrase”: “dog”,
“phonetic_encoding”: “PHONETIC_ENCODING_X_SAMPA”,
“pronunciation”: “"k{t”
}
],
“voice”: {
“language_code”: “en-US”,
“name”: “en-us-Chirp3-HD-Leda”
},
“audio_config”: {
“audio_encoding”: “LINEAR16”
}
}

Mungkin ini bisa membuatmu paham

Niran_Pravithana · September 23, 2025, 5:33pm

Terima kasih banyak, sekarang saya lebih paham.

Topic		Replies	Views
Transcribe text to text and vice versa, speech to speech and image to text in a flutter app using gemini Gemini API	15	714	May 20, 2024
Is it possible to make the Chirp 3 TTS Voice sound like gemini-2.5-flash-tts-preview? Gemini API prompt , gemini-25 , gemini-flash-2-5	1	283	July 3, 2025
How do I build a custom voice recognition model for multiple people? TF.js tfjs , datasets , help_request	24	7151	September 18, 2021
How does one get access to the API for TTS features of Gemini-2.0? Google AI Studio feature_request	8	1446	December 21, 2024
How to Customize Gemini 2.0 Flash Voice for Hypnotic, Slow-Paced Guided Meditation with Pauses? Gemini API gemini	2	682	December 12, 2024

Custom pronunciations not working with Chirp3 HD voices

Related topics