Image Understanding and Segmentation Mask Support

im_peterj · May 20, 2026, 9:09pm

Are segmentation masks still supported? It is unclear what the current support is for the generation of image segmentation masks.

According to the docs, “starting with Gemini 2.5, models not only detect items but also segment them and provide their contour masks.”

The docs say that the segmentation masks are given as a base64 png that is a probability map with values between 0 and 255.

From the example in the docs, the following prompt suggests how to instruct gemini to give the segmentation masks:

prompt = """
Give the segmentation masks for the wooden and glass items.
Output a JSON list of segmentation masks where each entry contains the 2D
bounding box in the key "box_2d", the segmentation mask in key "mask", and
the text label in the key "label". Use descriptive labels.
"""

However, the behavior of segmentation masks is inconsistent across model versions.

Gemini 2.5 currently produces segmentations mask results like this:

{"mask": "<start_of_mask><seg_4><seg_20><seg_4><seg_35><seg_65><seg_27><seg_27>"}

While Gemini 3.5 produces the segmentation mask of the item as a polygon of [x,y] coordinates like this:

{"mask": [[325, 411], [327, 471], [332, 523], [397, 534], [403, 492], [408, 426]]}

In both of these cases the segmentation masks are not produced as base64 png.

Has support for segmentation masks been changed? What is the expected behavior?

Topic		Replies	Views
Gemini vision compabilities Gemini API	1	95	May 15, 2024
Image understanding does not seem to work using the Openai compatible API Gemini API issues , openai_compatibility	2	138	June 12, 2025
Gemini 2.5 Grounding Segments Indices Incorrect (with Google Search) Gemini API api , models , python	1	189	May 21, 2025
Image processing: Prompt vs API Gemini API gemini-api , experimental , model , gemini-flash , gemini-20	8	146	March 5, 2025
Bounding Box detection Failing with Gemini 2.0 flash Gemini API api , gemini-flash , gemini-20	1	223	June 12, 2025

Image Understanding and Segmentation Mask Support

Related topics