Multi-Image Input Support for Veo 3.1 API Video Generation

Khizar_Alam · February 6, 2026, 3:27pm

Hello,

I’m currently experimenting with Veo 3.1 for video generation.

Using the Gemini API (google-genai), I’ve confirmed that video generation works with a single reference image and a prompt. However, my goal is to input multiple images (e.g., first and last frames or a small set of keyframes) to create a continuous video sequence, and this does not appear to be supported by the Python SDK.

I’d like to clarify the following points:

Is multi-image input officially supported when accessing Veo 3.1 via the Gemini API, specifically to create a video sequence (e.g., start/end frames or keyframes)?
If not, is this a limitation of the model itself or of the current SDK?
Does the Vertex AI API expose additional Veo 3.1 capabilities—such as first/last frame interpolation or multi-image input for generating a video sequence—that are not currently available through the Gemini API or Python SDK?

Any clarification on the official support status and recommended integration patterns would be greatly appreciated.

Thank you for your time and support.

Best regards,
Khizar

Topic		Replies	Views
Is there a way to generate videos with multiple images using the Veo3 API? Gemini API prompt , veo	2	700	October 4, 2025
Create video from image - Gemini Veo API Gemini API api , veo	1	100	November 21, 2025
How does VEO3.1 achieve the "Using first and last frames" function through the API? Gemini API help_request , veo	2	144	December 29, 2025
Veo 3.1 public API availability & pricing (60s, 1080p, multi-prompt transitions) Gemini API gemini , veo	2	1605	October 15, 2025
How to correctly structure the 'video' object for Veo 3.1 endpoint? Gemini API ai-studio , api , models , generative-ai , veo	6	810	October 31, 2025

Multi-Image Input Support for Veo 3.1 API Video Generation

Related topics