Bounding Box detection Failing with Gemini 2.0 flash

avanicious · May 21, 2025, 12:01pm

I am trying to get bounding boxes for clothes from outfit images. I’ve recreated the exact code given in the documentation.

github.com/google-gemini/cookbook

quickstarts/Spatial_understanding.ipynb

main

{
  "cells": [
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "lb5yiH5h8x3h"
      },
      "source": [
        "##### Copyright 2025 Google LLC."
      ]
    },
    {
      "cell_type": "code",
      "execution_count": 4,
      "metadata": {
        "cellView": "form",
        "id": "906e07f6e562"
      },
      "outputs": [],
      "source": [

This file has been truncated. show original

I successfully recreated the cupcake example with the exact bounding_box_system_instructions and the user prompt. But when I try to make it specific to my use case, by adding fashion context, it starts to give erroneous bounding boxes atleast 40% of the time.

I made minimal changes to the example prompt to get it to perform the same way but it’s not working out. I tried giving specific instructions to detect clothes as well, but didn’t work. Has anyone else faced this? How to prompt engineer here?

“Modified by Moderator”

Sangeetha_Jana · June 12, 2025, 7:13am

Hey @avanicious
Welcome to the community!
We have released new models with improved performance.
Please try out the gemini-2.5 series models for better results.
Feel free to reach out to us if required.
Thank you!

Topic		Replies	Views
Inaccurate Bounding Box for forms Gemini API api	9	558	June 20, 2025
Gemini-flash-2-5 for bounding box detection performs worse when using thinking Gemini API api , gemini-flash-2-5	2	342	June 25, 2025
Bounding Box for pdf using Flash 2.0 Gemini API gemini-flash	3	328	April 4, 2025
Visual grounding capabilities of gemini 3 flash degraded Gemini API models , gemini-flash , gemini-3	3	156	January 12, 2026
Issues with the Accuracy of Object Coordinates Detected by Gemini 1.5 in Images Gemini API gemini-15	6	766	June 10, 2024

Bounding Box detection Failing with Gemini 2.0 flash

Related topics