Regarding Google Project ready Voice module

Hi,
I’m seeking for a Google-provided, production-ready voice module for my project, similar in function to OpenAI’s Whisper, but with a specific focus on real-time, speech-to-speech conversation.

Does Google (through Google Cloud, Gemini, or Vertex AI) offer a high-performance API or SDK that I can purchase and integrate?

The ideal service would need to:

Transcribe incoming audio from a user in real-time.

Process that transcription (e.g., send it to an AI for a response).

Synthesize the text response back into natural-sounding speech.

Deliver this synthesized voice back to the user with minimal latency to enable a fluid, real-time conversation.

In my research, I found a model named gemini-2.5-flash-native-audio-preview-09-2025. This seems promising, but I have a few specific questions:

Is this the correct model for my real-time speech-to-speech use case?

What is the status of this model? The “preview” tag suggests it might not be stable or ready for a live, production-level project. Can you confirm if this is project-ready?

If this is the right choice, what is the recommended integration path or SDK for building a full-duplex voice assistant around this model?

Regards,
Mehedi

Hi @Mehedi_Hasan_Shihab
Thanks for reaching out to us!

Yes, Google offers the Gemini Live API which enables low-latency, real-time voice and video interactions with Gemini. It handles continuous streams of audio, video, or text to deliver immediate, human-like spoken responses, creating a natural, real-time conversational experience for users.

The model gemini-2.5-flash-native-audio-preview-09-2025, used with the Gemini Live API, is designed for real-time audio input and output. Please note that being a Preview model, it may be used for production, but will typically have billing enabled, might come with more restrictive rate limits and will be deprecated with at least 2 weeks notice.

For integration, recommended path is using Gemini Live Api via WebSockets for streaming.

Please refer to the official documentation for more details here: Get started with Live API

1 Like

Thank you Sonali.
I already did that. Solved now.