Issue with Gemini 1.5 Pro EXP API: Getting Different Results Compared to AI Studio Playground

Gfsrdd_Sweeh · October 25, 2024, 8:25am

I’m using the Gemini 1.5 Pro EXP model API with the image shown below, but I’m encountering the same issue. Here’s the error I’m receiving:

Response:

python
GenerateContentResponse(
    done=True,
    iterator=None,
    result=protos.GenerateContentResponse({
      "candidates": [
        {
          "finish_reason": "RECITATION",
          "index": 0
        }
      ],
      "usage_metadata": {
        "prompt_token_count": 407,
        "total_token_count": 407
      }
    }),
)

Here’s the code I’m using:

import google.generativeai as genai
import os
import PIL.Image
from dotenv import load_dotenv
from google.generativeai.types import HarmCategory, HarmBlockThreshold

load_dotenv()

# Configure the API key
genai.configure(api_key=os.environ["GEMINI_API_KEY4"])

# Define generation configuration
generation_config = {
    "temperature": 0,
    "top_p": 0.95,
    "top_k": 64,
    "max_output_tokens": 8192,
    "response_mime_type": "text/plain",
}

# Create a GenerativeModel instance
model = genai.GenerativeModel(
    model_name="gemini-1.5-pro-exp-0827",
    generation_config=generation_config,
    safety_settings={
        HarmCategory.HARM_CATEGORY_HATE_SPEECH: HarmBlockThreshold.BLOCK_NONE,
        HarmCategory.HARM_CATEGORY_HARASSMENT: HarmBlockThreshold.BLOCK_NONE,
        HarmCategory.HARM_CATEGORY_SEXUALLY_EXPLICIT: HarmBlockThreshold.BLOCK_NONE,
        HarmCategory.HARM_CATEGORY_DANGEROUS_CONTENT: HarmBlockThreshold.BLOCK_NONE,
    },
    system_instruction="""
    Extract questions and their corresponding options (if applicable) from an image of a question paper, including any directions, instructions, and question numbers. The extracted content should be formatted in LaTeX code, preserving the original structure and layout of the question paper. 

    - **Questions**: Format each question with its number.
    - **Options**: Use the enumerate environment for multiple-choice options, labeling each option as (a), (b), (c), etc.
    - **Diagrams**: If a question includes a diagram, use the TikZ package to recreate the diagram in LaTeX. The TikZ code should be appropriately placed within the LaTeX document, ensuring the diagram aligns with the relevant question.
    """
)

# Path to the images folder
images_folder = "images"
image_files = [f for f in os.listdir(images_folder) if f.endswith('.png')]

# Process each image file
for image_file in image_files:
    image_path = os.path.join(images_folder, image_file)
    sample_image = PIL.Image.open(image_path)

    # Generate content based on the image
    response = model.generate_content([sample_image])

    # Print the generated LaTeX code or handle the response as needed
    print(response)

When I use the same image and system instruction in the Gemini AI Studio Playground, I receive the correct response. You can see the following images:

1.The first image shows the API response.

The second image shows the Gemini AI Studio Playground response.

Screenshot 2024-09-03 1234261657×933 114 KB
The third image shows the safety settings.

Screenshot 2024-09-03 1237001188×811 94.9 KB
The fourth image is the sample image to process.

question1 (12)1920×1080 250 KB

Topic		Replies	Views
How to do batch Inference on Prompt Image pairs with Gemini API without getting errors Gemini API gemini-15 , bug , api	1	298	May 28, 2024
Significant Difference in Response Quality between Google AI Studio and Gemini 2.5 Pro API (gemini-2.5-pro-03-25) Gemini API feedback , api , gemini-25 , gemini-2-5	7	403	June 4, 2025
Flash 2-0 doesn't respect BLOCK_NONE on ALL harm categories Gemini API bug , api , safety	7	1174	May 8, 2025
Safety settings don't seem to work with search? Gemini API bug , api	1	47	May 16, 2025
Cannot use system instruction with stream mode of `gemini-1.5-flash-002` Gemini API gemini-15 , bug , api	7	372	January 10, 2025

Issue with Gemini 1.5 Pro EXP API: Getting Different Results Compared to AI Studio Playground

Related topics