Is there a way in the Gemini Live API to detect “near end of generation” (not just final completion)?

Pius_Shedrach · December 24, 2025, 9:55am

Hey all, I’m working with the Gemini Live API (via WebSockets) and using it for streaming LLM output in real-time. I understand that the API exposes signals like generationComplete and turnComplete, which tell me when the model has finished its current output. I can react to those cleanly in my client or backend.

What I need is something a bit different:
Instead of waiting until the model is done, I want a way to detect when the model is getting close to done — so I can call a function, update my UI, prep the next turn, or transition my state before the final completion happens.

Right now my model pipeline looks like this:

Client opens a gemini.live.connect session.
I stream text/audio and receive chunks back from the model.
I watch for server_content.generation_complete or server_content.turn_complete to know the reply is finished.

But there doesn’t seem to be any built-in “N tokens left” or “almost done” event in the Gemini Live spec that gets emitted before generationComplete. The API docs only define the normal completion flags — no progress percentage or remaining token info.

Before I build a heuristic (like counting streamed tokens/chars and calling my callback when some threshold is met), I wanted to check:

Has anyone seen undocumented or hidden signals that signal “approaching end of generation”?
Are there better client-side heuristics people use in Gemini Live if they need early notice of an ending?
Or is the community just using generationComplete as the de facto only reliable signal?

For context: I’m aware this isn’t about end of turn detection or voice activity detection — I’m talking strictly about approaching end of the model’s text/audio generation while it’s streaming.

Thanks!

Topic		Replies	Views
🎙️ Real-Time Meeting Bot with Multiple Speakers — Can Gemini Live API Replace Deepgram for Faster Agenda Tracking? Gemini API api , models , live-streaming	0	99	June 6, 2025
How can I track token usage when streaming content with Gemini? Gemini API gemini	2	96	January 22, 2026
Is it possible to manually set end of turns for the live api when inputing audio? Gemini API model , audio	1	113	June 5, 2025
Update On LiveClientRealtimeInput Gemini API api	1	186	January 25, 2025
Audio transcript in Gemini Live API not really working Gemini API api , gemini-api	5	204	November 25, 2025

Is there a way in the Gemini Live API to detect “near end of generation” (not just final completion)?

Related topics