Bring back the Half Cascade gemini-live-2.5-flash-preview model

BarricadeBandit · December 11, 2025, 8:50am

The new “Native Audio” Model is worse at tool calling when compared the MUCH better gemini-live-2.5-flash-preview model, The old model used to be able to do function/tool calling 90-100% of the time whereas the new model struggles most the time, and hallucinates saying that it did use the function call when infact it did not, I see no logical reason to remove the OLDER but not by much OLDER model and just leaving users with this “newer” model that doesn’t even support TEXT only modality, only audio output, It’s just overall worse at the moment until its refined, Especially considering the amount of people’s services that was dependent on the older model as it was more Reliable and Stable

Sonali_Kumari1 · December 12, 2025, 7:37am

Hi @BarricadeBandit

We appreciate you taking the time to share your thoughts with us, your feedback is invaluable as we work to continuously improve the Gemini API experience.

Mikel_Does_Things · December 14, 2025, 7:01pm

Request: Please Reactivate gemini-live-2.5-flash-preview as a GA Model

The deprecation of gemini-live-2.5-flash-preview has significantly impacted our production workflows. This model was unmatched for real-time transcription and translation use cases.

Why gemini-live-2.5-flash-preview was essential:

Superior speed for live audio processing
Excellent quality for simple text outputs
Cost-effective for production workloads
Ideal for the Live API’s streaming capabilities

Issues with gemini-2.5-flash-native-audio-preview:

Noticeably slower response times
Degraded performance for straightforward text transcription
Higher costs that make it unsuitable for many production applications

Our request: Please consider reactivating gemini-live-2.5-flash-preview alongside the newer model. Many developers in the community are facing the same challenges. Since the infrastructure already exists, offering both options would give developers the flexibility to choose the right tool for their specific use cases without forcing migration to a model that doesn’t meet their requirements.

2010b9 · December 30, 2025, 11:44am

Hey!

Just wanted to upvote this comment.

I really don’t understand why the support for text output was removed… This may be subjective, but from my point of view, speech-to-speech models still have to mature a bit more to be used in production (at least in the context I’m working on). The half-cascaded architecture that Google used to describe in their LiveAPI documentation was, for me, the current sweet spot. I didn’t need to use separate models (VAD + ASR + LLM + TTS) because the model would handle the VAD + ASR + LLM. The experience was really smooth and I could better control the output audio, even use my own cloned voice.

I’ll probably switch to OpenAI’s models or even to a service like Ultravox that does just that.

Also, just a couple of months ago, Google stated in their documentation that “It [half-cascaded architecture] offers better performance and reliability in production environments, especially with tool use.“ when comparing the half-cascaded audio with native audio. I didn’t find any information in their documentation about what changed and this claim was simply removed from the documentation, as far as I could tell.

BarricadeBandit · January 29, 2026, 6:43am

Any updates on bringing back a Half Cascade model?

Topic		Replies	Views
Live API discontinuation for gemini-live-2.5-flash-preview — degraded behavior, higher hallucinations, and no clear replacement? Gemini API api , live-streaming , gemini-flash-2-5	1	242	October 28, 2025
Please return "gemini-live-2.5-flash-preview" into Stream of AI Studio Google AI Studio model , gemini-flash-2-5	4	131	October 16, 2025
Critical Regression in native-audio-preview & Deprecation Confusion for Dec 9, 2025 Gemini API api , live-streaming	5	356	January 12, 2026
We need a half cascade solution! Give some love to gemini-2.0-flash-exp Gemini API gemini-flash	0	35	January 19, 2026
Why discontinue "Gemini 2.0 Flash Live" and "Gemini 2.5 Flash Live"? Gemini API api , gemini-flash-2-5	11	527	November 4, 2025

Bring back the Half Cascade gemini-live-2.5-flash-preview model

Related topics