Image-to-Image generation fails when using files API

h2.d2 · November 3, 2025, 9:11am

When performing an image-to-image (i2i) or image editing task with gemini-2.5-flash-image, the operation succeeds when the source image is provided as Base64-encoded `inlineData`. However, the operation fails silently (returns a text response without an image) when the same source image is provided via the Files API using a `fileData` part. Image _analysis_ tasks work correctly with the Files API.
This suggests that the internal image generation/editing module cannot resolve `fileUri` references from the Files API and requires the raw pixel data to be sent directly in the request.

Shivam_Singh2 · November 3, 2025, 9:19am

Hii @h2.d2
Welcome to Google AI Forum!!

Thank you for bringing this to our attention.
Could you please share the full payload details along with some sample of the code which you are using? We would like to reproduce the issue.

h2.d2 · November 3, 2025, 9:50am

Steps to Reproduce

Prerequisites:

A valid Google AI API Key.
An image file (e.g., my-image.png) ready for upload.
A script or tool to make requests to the /v1beta/models/gemini-2.5-flash-image:generateContentendpoint.

Scenario 1: Image Editing with `inlineData` (Works as Expected)

Prepare the Request Body: Create a JSON payload where the source image is Base64-encoded and placed in an inlineData part. The image part should come before the text prompt in the parts array.

{
  "contents": [
    {
      "role": "user",
      "parts": [
        {
          "inlineData": {
            "mimeType": "image/png",
            "data": "[BASE64_ENCODED_IMAGE_STRING]"
          }
        },
        {
          "text": "Edit this image. Make the hair red."
        }
      ]
    }
  ]
}

Send the Request: Send a POST request with this body to the generateContent endpoint.
Observe the Result: The API returns a response containing both a text part (e.g., “Here is the image with red hair”) and an inlineData part with the newly generated image.

Expected Behavior: An edited image is successfully generated and returned. Actual Behavior: This works correctly.

Scenario 2: Image Editing with `fileData` (Fails)

Upload the Image: First, upload the source image (my-image.png) to the Files API to obtain a fileUri (e.g., files/xyz123).

Prepare the Request Body: Create a JSON payload that references the uploaded image via a fileData part. The image part should come before the text prompt.

{
  "contents": [
    {
      "role": "user",
      "parts": [
        {
          "fileData": {
            "mimeType": "image/png",
            "fileUri": "files/xyz123" // The URI from the Files API
          }
        },
        {
          "text": "Edit this image. Make the hair red."
        }
      ]
    }
  ]
}

Send the Request: Send a POST request with this body to the generateContent endpoint.
Observe the Result: The API returns a response containing only a text part (e.g., “Of course, here is the image with red hair.”), but no inlineData part with the actual image is included. The image generation fails silently.

Expected Behavior: An edited image should be generated and returned, just like in Scenario 1. Actual Behavior: The model acknowledges the request in text but fails to produce an image.

Hypothesis

The generateContent endpoint successfully routes analysis tasks to a module that can access the Files API. However, for generation/editing tasks, it appears to route the request to a different internal module that does nothave access to the Files API and requires the image data to be provided directly via inlineData.

This behavior is not explicitly documented, leading to the assumption that fileData should be a valid input for all multimodal tasks. A clarification in the official documentation or a fix to allow the generation module to resolve fileUris would be highly beneficial.

Shivam_Singh2 · November 18, 2025, 11:44am

Hii,

I have successfully implemented this using the File API. Please refer to the attached code for the working solution.

from google import genai
from google.genai import types
from google.colab import userdata
import pathlib

API_KEY = userdata.get('API_KEY')
client = genai.Client(api_key=API_KEY)


file_path = pathlib.Path("/content/self.jpeg")

# Upload this using the File API
sample_file = client.files.upload(
  file=file_path,
)

prompt = "Generate black and white image of the provided image"

response = client.models.generate_content(
    model="gemini-2.5-flash-image",
    contents=[prompt, sample_file],
)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        image.save("generated_image.png")

h2.d2 · December 15, 2025, 4:43pm

Thanks! It’s working as expected now.

Topic		Replies	Views
500 error when including a file Gemini API api , model	12	356	September 17, 2024
Unable to upload files to Gemini 2.0 : File not exists in Gemini API Gemini API gemini-20	6	593	May 11, 2025
"Request contains an invalid argument" when use uploaded PDF Gemini API gemini-api , gemini-20	8	1551	February 14, 2025
Error using image and a prompt Google AI Studio gemini-15 , api , models	13	1470	December 8, 2024
UnidentifiedImageError when running the exact code from the api docs Gemini API api , ai , image-generation	5	151	September 17, 2025

Image-to-Image generation fails when using files API

Steps to Reproduce

Scenario 1: Image Editing with inlineData (Works as Expected)

Scenario 2: Image Editing with fileData (Fails)

Hypothesis

Related topics

Scenario 1: Image Editing with `inlineData` (Works as Expected)

Scenario 2: Image Editing with `fileData` (Fails)