How to do batch Inference on Prompt Image pairs with Gemini API without getting errors

OrangiaNebula · May 28, 2024, 6:38pm

Welcome to the forum!

The problem, I believe, is this specification:

You cannot get max_output_tokens past the system limit (which you get from list_models()). In the case of the 1.5 Gemini (both pro and flash) that value is 8192. The one million context window applies to input tokens.

That effectively forces you to partition your input to chunks of probably 2, and will require that you use enough generate_content() requests to get through the dataset.

Topic		Replies	Views
openai.BadRequestError: Error code: 400 when try to generate structured nested output Gemini API api	5	287	March 18, 2025
ValueError: ("Invalid operation: The `response.text` quick accessor requires the response to contain a valid `Part`, but none were returned Gemini API gemini-api , validation	4	2047	November 18, 2024
Error using image and a prompt Google AI Studio gemini-15 , api , models	13	1156	December 8, 2024
504 Deadline Exceeded Error Gemini API models , prompt	8	311	June 26, 2025
400 Invalid Argument - Content.parts must not be empty Gemini API generative-ai , gemini-flash	2	405	May 12, 2025

How to do batch Inference on Prompt Image pairs with Gemini API without getting errors

Related topics