Mixed Image + Text output with Nano Banana Pro

SebastianWitt · December 16, 2025, 8:57am

With Nano Banana it was possible to generate an image and simultaneously output a text alongside it.

A prompt like:
”Return an image of a space shuttle launch then describe what you would expect to see in the image. Return an image and the text description.”

Would return the image and a text. With Nano Banana Pro this is no longer possible. I am guessing this is due to the thought of chain?

Maybe there is a trick but I was not able to get both image and text from a single api call. Obviously i could send the image back to gemini and request a description but i was hoping to skip this additional step.

If it is intended that output modalities “image + text“ should be able to return both in the same prompt consider this a bug report. If it is not currently within the scope of the models capabilities this is more of a feature request.

-Thanks for the awesome model

SebastianWitt · December 16, 2025, 9:05am

I just found this thread. This is a duplicate of https://discuss.ai.google.dev/t/text-component-missing-with-gemini-3-pro-image-preview/110635

Topic		Replies	Views
Text component missing with gemini-3-pro-image-preview Gemini API bug , api , image-generation	3	284	January 5, 2026
Using Gemini Nano Banana AI to generate an image, but it tells me it can’t generate images? Gemini API image-generation	6	1416	January 30, 2026
Imagen 4.0 API Issue: Long Contextual Prompts Rendered as Text Instead of Creative Guidance - Multimodal Alternative Needed? Gemini API api , image-generation	1	197	June 6, 2025
Image generation API hits sometimes return text only Gemini API api , gemini-3	0	37	April 20, 2026
Complex prompt (possible cause) leading to no image with Nano Banana Pro API Gemini API vertexai	5	332	January 8, 2026

Mixed Image + Text output with Nano Banana Pro

Related topics