[Critical Bug] Image Vision Retrieval Always Returns First Generated Image

Michael_Bowerman · December 31, 2025, 3:40am

Severity: P0 - Critical (Core feature completely broken)

Product: Gemini 3 (Free Tier)

Summary:
When Gemini generates multiple images in a conversation and attempts to view them using its vision capabilities, it consistently returns the first generated image regardless of which image is requested. This makes it impossible for Gemini to accurately describe or analyze any images it generates after the first one.

Reproduction Steps:

Start a new conversation with Gemini
Ask Gemini to generate an image (e.g., “Generate a red square”)
Ask Gemini to generate a second, different image (e.g., “Generate a blue circle”)
Ask Gemini to look at and then describe the second image it just generated
Observe that Gemini describes the first image (red square) instead of the second (blue circle)

Expected Behavior:
Gemini should be able to view and accurately describe each image it generates, referencing the correct image data for each position in the conversation history.

Actual Behavior:

Single-image vision tool: Returns the first generated image 100% of the time when viewing subsequent Gemini-generated images in standalone outputs (no accompanying text)
Multi-image vision tool: Inconsistently returns either correct images or n copies of the first image (no clear pattern identified)
Exception: Single-image tool works correctly for user-uploaded images
Exception: If Gemini generates both text and an image in the same response, it can see that specific image correctly

Impact:

Gemini cannot reliably describe, analyze, or reference its own generated images
Users receive hallucinated descriptions of images
Multi-turn image generation workflows are completely broken
This undermines trust in Gemini’s multimodal capabilities

Technical Analysis:
The bug appears to be in the image retrieval backend, likely in how images are indexed/cached in conversation history:

# Current (broken) behavior appears to be:
def get_conversation_image(image_index):
    return conversation.images[0]  # Always returns first image

# Expected behavior:
def get_conversation_image(image_index):
    return conversation.images[image_index]  # Returns requested image

Workarounds:

None reliable for single-image retrieval
Multi-image retrieval sometimes works but is inconsistent
User-uploaded images can be viewed correctly
Generating image with text in same response may work (but triggers separate JSON leakage bug ~75% of the time)

Additional Context:
When using multi-image retrieval, Gemini’s internal reasoning shows it correctly receives metadata indicating image_generation_content/0 is being returned, suggesting the bug is in the backend API that retrieves images, not in Gemini’s tool-calling logic.

Reproducible: Yes, 100% reproducible for single-image tool on Gemini-generated images after the first one

Test Conversation Links:

https://gemini.google.com/share/2ce932fac5ea
https://gemini.google.com/share/e63f34e5566c (Gemini mentions “image_0” at one point in its reasoning here, although unfortunately its reasoning does not seem to be visible in the shared conversation)
https://gemini.google.com/share/1301acb1c8bd
And many, many more… I have consistently reproduced this many times, but the conversations above are just listed as examples.

Srikanta_K_N · December 31, 2025, 9:05am

Hi @Michael_Bowerman,

Thank you for bringing this to our attention. We truly appreciate you flagging this issue, we will file a bug internally.

Topic		Replies	Views
[Major Bug] Image Generation Prompt Mismatch in Conversation History Gemini API bug , prompt	0	37	December 31, 2025
[Moderate Bug] JSON Leakage When Generating Text + Image in Same Response Gemini API bug , image-generation	8	76	January 3, 2026
Image caching leads to wrong behavior of gemini? Gemini API gemini-flash , context_caching	4	164	December 1, 2025
[Critical] Gemini 2.5 Pro Response Error, Possible Memory Error? Gemini API thinking , gemini-2-5	8	359	June 19, 2025
Zero-Tolerance Policy on Photos Causes Silent Failure and Customer Churn (Gemini 2.5 / Image Generation) Google AI Studio bug , api , gemini-2-5 , image-generation	2	77	November 29, 2025

[Critical Bug] Image Vision Retrieval Always Returns First Generated Image

Related topics