Currently, the Gemini text-to-speech (TTS) interface (gemini-2.5-pro-preview-tts) “only” supports Linear-PCM encapsulated in a WAV container as the output format. The official documentation does not provide configuration parameters to directly encode speech into AAC/M4A.
I really wish it could support AAC/M4A format in the response. Thanks!
“Modified by moderator” It causes me to be very impolite to gemmy sometimes. It really would be nice to drive to work and converse about what’s on your mind, could take the place of lying in bed way too late at night reading Wikipedia articles like a few years ago.
In general though, yeah totally agree and support.