Comparing OpenAI's Image Generation with Gemini

Yan_Cheng_Cheok · May 21, 2025, 2:17pm

Hello,

I’m curious whether OpenAI’s image generation model is significantly more advanced than Gemini’s, or if I might not be using Gemini correctly. Could you clarify the differences or suggest best practices for using Gemini effectively?

OpenAI
======

    client = OpenAI(api_key=OPEN_AI_KEY)

    prompt = "Turn this image into Ghibli-style animation art"

    model="gpt-image-1"

    result = client.images.edit(
        model=model,
        image=open("input.jpg", "rb"),
        prompt=prompt
    )

    image_base64 = result.data[0].b64_json
    image_bytes = base64.b64decode(image_base64)

    # Save the image to a file
    with open("output.jpg", "wb") as f:
        f.write(image_bytes)



Gemini
======
    client = genai.Client(api_key=API_KEY)

    image = Image.open("input.jpg")

    prompt = "Turn this image into Ghibli-style animation art"

    response = client.models.generate_content(
        model='gemini-2.0-flash-exp-image-generation',
        contents=[prompt, image],
        config=types.GenerateContentConfig(
            response_modalities=['Text', 'Image']
        )
    )

    for part in response.candidates[0].content.parts:
        if part.text:
            print(part.text)
        elif part.inline_data:
            result_image = Image.open(BytesIO(part.inline_data.data))
            result_image.save('output.jpg')
            result_image.show()

Imgur: The magic of the Internet - Open AI output (good)

Imgur: The magic of the Internet - Gemini output (bad)

Akhilesh_Kambhampati · May 21, 2025, 6:48pm

@Yan_Cheng_Cheok,

welcome to the community, Thank you for reaching out.

its not that one model is better or worse than the other. Models perform well on the data they were trained/fine-tuned on.

in this case,I believe the OPENAI model was fine tuned on gibli style animation/art but GEMINI is not fine-tuned for making art. its is intended for overall realistic image generation

if you want specific style you can try open models that are fine-tuned to the specific style you need.

Note: Checkout Huggingface or Civit.ai for these fine tuned models and they will perform better than any api when it comes to that specific style for all else they would underperform.

Topic		Replies	Views
Gemini-2.0-Flash-Preview-Image-Generation quality reduction in recent update Gemini API models , gemini-flash , gemini-20	12	298	May 24, 2025
Gemini 2.0 Flash (Image Generation) Experimental stopped generating images Gemini API prompt , gemini-flash	5	551	April 17, 2025
Save images in Gemini Gemini API gemini , gemini-20 , image-generation	4	102	April 25, 2025
Token Count Differences between google-generativeai and OpenAI API for Gemini in Python Gemini API open-models , ai	2	140	February 25, 2025
Significant Difference in Response Quality between Google AI Studio and Gemini 2.5 Pro API (gemini-2.5-pro-03-25) Gemini API feedback , api , gemini-25 , gemini-2-5	7	427	June 4, 2025

Comparing OpenAI's Image Generation with Gemini

Related topics