Any nice way to output more than 8k tokens of structured json?

Jesse_Gordon · February 17, 2025, 10:02pm

The 2m context window is great from the latest Gemini models, but 8k output tokens is far less than competitors models. Of course for text data, you can ask the model to continue, however for json validated outputs, it’s not as straightforward.

jkirstaetter · February 18, 2025, 8:58am

Hi,

Welcome to the forum.

Do you have a good use or example? Eventually, it might be better to open an issue here: https://issuetracker.google.com

Cheers.

bondioso · February 24, 2025, 4:59pm

I’m running into the same issue, I want to use Google Gemini 2.0 Flash to take call recordings from my call center and transcribe them and provide classification and or feedback/context and return a structured json with the desired format, currently for a 50 minute conversation the 8K context is exausted at minute 15, althouhg on Google AI Chat I can tell it to. “Continue” and it will keep writing the json, its not so feasible since I would have to make sure to concatenate the json and validate it, not sure if there is a better way to achieve this.

Topic		Replies	Views
Handling Token Limits in Gemini-1.5-Flash API Responses Gemini API gemini-15 , api	1	143	September 27, 2024
Resuming structured output after MAX_TOKENS cut-off Gemini API gemini-15	2	96	March 3, 2025
Can I increase max_output_tokens Gemini API api , models	2	1009	December 18, 2024
How to expand Gemini output window Gemini API help-request , new-features	6	1340	October 9, 2024
Output tokens limit for the finetuned gemini flash 1.5 Gemini API fine-tuning	12	2366	October 12, 2024

Any nice way to output more than 8k tokens of structured json?

Related topics