JSON Schema causes issues with Gemini Pro/Flash

Stefan_Streichsbier · November 14, 2024, 12:15am

I have confirmed an issue when enabling a JSON schema, which results in the underlying model Pro/Flash producing repeating tokens until the max output tokens are reached.

I’m using the Python SDK (0.8.3) and a simple schema:

class SingleMatchVerificationResult(TypedDict):
    conclusion: bool
    confidence: int 
    reason: str
    confidence_reason: str

that eventually is passed directly to:

response = client.generate_content(
            prompt,
            generation_config=genai.types.GenerationConfig(
                candidate_count=1,
                max_output_tokens=512,
                temperature=0,
                response_mime_type="application/json" if json_mode else "text/plain",
                response_schema=json_schema if json_mode else None
            )
        )

Here is a sample response that shows it repeating itself.

I can share plenty of prompts and provide code that will help reproduce this.

Let me know what is needed.

Vishal · November 14, 2024, 5:40am

Thanks for flagging this! Would you be able to share a sample prompt so I can repro it on my end?

Stefan_Streichsbier · November 15, 2024, 1:23am

Hi Vishal, sure thing, I’ve created a gist for this here:

Prompt to reproduce JSON schema mode issues in the Gemini API · GitHub

Vishal · November 17, 2024, 11:59pm

Thank you! I’ll take a look and file it with Eng.

Vishal · November 18, 2024, 10:29pm

Surprisingly, I get a proper response when I set json_mode to False, so this is definitely a JSON mode issue. Filed this with Eng!

Stefan_Streichsbier · December 10, 2024, 11:02am

Thanks, Vishal, I just tested it again today and noticed that the issue is still there.
Is there any update from Eng?

Joel_borc · December 10, 2024, 7:13pm

It’s still happening to me, too. It’s so persistent I had to switch models on my side.

Stefan_Streichsbier · December 11, 2024, 5:22am

Yeah, it’s a real issue.
Without schema validation, models like Flash and Flash 8b regularly respond with broken JSON, even with enabled JSON mode.
With schema validation, it completely breaks for all models.

We can’t run production workloads like this.

Stefan_Streichsbier · December 12, 2024, 4:13am

Update: I’ve just switched to the new google-genai python SDK and everything worked immediately.

This is sufficient for me and I can consider this issue closed.
The fix is to use the new SDK.

Topic		Replies	Views
Gemini Flash Model Ignoring JSON Schema in Prompts Gemini API gemini-15 , api , models , gemini-api	2	252	November 21, 2024
API error occur due to recent Gemini Update (after submission) Gemini API Developer Competition api	2	420	September 4, 2024
Gemini-1.5-flash does not use defined schema Gemini API gemini-15 , bug	1	155	June 14, 2024
Gemini Python SDK, response schema, data cut off, with max_output_tokens Gemini API python	1	188	October 16, 2024
Bug Report the model often starts creating repetitive sequences of tokens Gemini API gemini-15	12	845	April 11, 2025

JSON Schema causes issues with Gemini Pro/Flash

Related topics