Hi,
I’m working on a search engine that send the found documents to Flash model for summarization.
When I retrieve the docs and send them to the model I get: google.api_core.exceptions.InvalidArgument: 400 The input token count (34108) exceeds the maximum number of tokens allowed (32767).
By the limits listed online this should be well within the capabilities of the model (I also tested Pro model just to get the same error). Can I define the input token limit? Is it reduced by default?
P.S.
The error occurs only when use grounding (Vertex AI search). It works OK when grounding is not used.