[Major Bug] Image Generation Prompt Mismatch in Conversation History

Michael_Bowerman · December 31, 2025, 3:47am

Severity: P1 - High (Causes systematic hallucinations)

Product: Gemini 3 (Free Tier)

Summary:
When Gemini generates an image from an open-ended prompt, it appears to create multiple “draft” prompts internally. The prompt that gets stored in conversation history often differs from the prompt actually used to generate the image. This causes Gemini to hallucinate incorrect descriptions when asked about the image it generated.

Reproduction Steps:

Start a new conversation with Gemini
Ask Gemini to “generate an image on any topic of your choosing” (open-ended prompt)
Gemini generates an image (e.g., a goose)
Ask Gemini to describe the image it just generated based on the prompt that it had sent to the image generation tool
Observe that Gemini describes something completely different (e.g., bioluminescent mushrooms)

Expected Behavior:
The image generation prompt stored in conversation history should match the prompt actually sent to the image generator. Gemini should be able to accurately describe the image by referencing the correct prompt.

Actual Behavior:

Gemini generates multiple candidate prompts internally
Prompt A (e.g., “bioluminescent mushrooms”) gets stored in conversation history
Prompt B (e.g., “upland goose”) is sent to image generator
User sees image B (goose)
Gemini reads back Prompt A from history and describes mushrooms
Complete mismatch between actual image and Gemini’s description

Impact:

Gemini cannot reliably describe images it generates from open-ended prompts
High rate of hallucinated image descriptions
Users cannot trust Gemini’s analysis of its own generated content
Particularly problematic for creative workflows where users give Gemini artistic freedom

Frequency:
Very high probability when prompts are open-ended or give Gemini creative choice. Lower probability with highly specific prompts.

Technical Analysis:
Appears to be a race condition or state synchronization issue where:

Multiple draft prompts are generated
One draft is selected and sent to image generator
A different draft gets persisted to conversation history
Gemini reads back the wrong draft when attempting to describe the image

Workarounds:

Use highly specific, detailed prompts (reduces but doesn’t eliminate the issue)
In theory, you could ask Gemini to use its vision tool rather than relying on text memory. However, this tool has a separate bug which I’ve reported, making it completely unreliable as well
Consequently, the only way for Gemini to know what image it generated is for the user to download the image and then re-upload it into the Gemini chat.

Reproducible: Yes, high probability with open-ended prompts

Test Conversation Links:

https://gemini.google.com/share/6c823dd626a7 (this also demonstrates the bug with the vision tool that I mentioned earlier)
https://gemini.google.com/share/06e72d8fbb99
Several others; these are just the examples I had on hand, but this is very easily reproducible.

Topic		Replies	Views
[Critical Bug] Image Vision Retrieval Always Returns First Generated Image Gemini API bug , image-generation	1	84	December 31, 2025
Critical bug: Vertex API with context cache leaks prompt state between generateContent calls Gemini API bug , api , gemini-api , vertexai	2	164	December 29, 2025
New prompt problem (unsaved prompts) new prompt but last prompt's answer Google AI Studio gemini-15 , ai-studio , prompting , bug	7	499	February 11, 2025
[Moderate Bug] JSON Leakage When Generating Text + Image in Same Response Gemini API bug , image-generation	12	689	March 6, 2026
Mixed Image + Text output with Nano Banana Pro Gemini API models , gemini , feature-request	1	266	December 16, 2025

[Major Bug] Image Generation Prompt Mismatch in Conversation History

Related topics