Token limit error (1.5 Pro and Flash)

KonradPs · December 27, 2024, 12:36am

Hi,

I’m working on a search engine that send the found documents to Flash model for summarization.
When I retrieve the docs and send them to the model I get: google.api_core.exceptions.InvalidArgument: 400 The input token count (34108) exceeds the maximum number of tokens allowed (32767).
By the limits listed online this should be well within the capabilities of the model (I also tested Pro model just to get the same error). Can I define the input token limit? Is it reduced by default?

P.S.
The error occurs only when use grounding (Vertex AI search). It works OK when grounding is not used.

Govind_Keshari · December 27, 2024, 5:30am

Hi @KonradPs,

Have you tried with “2.0-flash-exp” ?? Is it the same case with this model also??

KonradPs · December 27, 2024, 10:13am

I only tested 1.5-flash. It seems that when using grounding the token limit is different compared to ‘non-grounded’ use.

Hugo_Adame · December 30, 2024, 5:25am

Escriba o pegue el código aquí**texto fuerte

> Cita en bloque

**

Govind_Keshari · December 30, 2024, 10:11am

Hey @KonradPs,

Grounding is not supported for non-text input with 1.5 models. Without retrieval tool it will work. Use “2.0-flash-exp” latest model with grounding if you want to upload files or multimodal input.
You can also try this in Vertex AI studio or in Google AI studio, it will throw error.

Udheshganth_R · February 21, 2025, 6:23am

yes the tokens are limited for 2.0-flash-exp that is
32767

Audun · March 25, 2025, 9:51am

If anyone has similar issues with Gemini 2.0 Flash with structured output, I solved the issue by adding more tokens(!). By adding ~2000 additional tokens to my system prompt, marked as padding to not confuse the model, the requests worked again!

Topic		Replies	Views
Input token limits after Finetuning Gemini API gemini-15 , fine-tuning	3	166	February 20, 2025
Output tokens limit for the finetuned gemini flash 1.5 Gemini API fine-tuning	12	2371	October 12, 2024
400 Invalid argument while using candidate_count>2 and long json in the prompt Gemini API prompt	2	227	February 28, 2025
Payload Size Limit Error with embed_content API Gemini API ai-studio , api , models	5	583	January 9, 2025
Bug Report the model often starts creating repetitive sequences of tokens Gemini API gemini-15	12	867	April 11, 2025

Token limit error (1.5 Pro and Flash)

Related topics