Feature request: Can Gemini API streaming return smaller text chunks?

Leo_Smith · May 15, 2026, 9:06am

Hi everyone,

I’m using Gemini API streaming in a chat UI, but the chunks I get are sometimes pretty large.

Is there any way to make the chunks smaller, or is this controlled by Gemini on the server side?

It would be nice if the API could stream smaller pieces of text for smoother UI updates.

Thanks!

Mustan_lokhand · May 15, 2026, 4:30pm

Thank you for the feedback! Currently, the streaming chunk size is managed dynamically on the server side by our model serving infrastructure. This batching is designed to optimize network efficiency and overall throughput

I have forward this request internally as well

Leo_Smith · May 15, 2026, 4:47pm

Got it, thanks for the clarification

Topic		Replies	Views
Is there a way in the Gemini Live API to detect “near end of generation” (not just final completion)? Gemini API gemini-api	0	56	December 24, 2025
How to robustly handle images in the streaming API to avoid Chunk too big errors? Gemini API api	1	111	October 14, 2025
Non-batch background generation Gemini API api , open-ai	2	62	December 8, 2025
Resuming structured output after MAX_TOKENS cut-off Gemini API gemini-15	2	282	March 3, 2025
Any nice way to output more than 8k tokens of structured json? Gemini API api	3	200	June 13, 2025

Feature request: Can Gemini API streaming return smaller text chunks?

Related topics