Gemini 1.5 Pro responds with incomplete JSON sometimes

jldupont · August 5, 2024, 4:59pm

Hi - I am exploring Gemini 1.5 Pro’s capabilities. I noticed that sometimes it provides incomplete (invalid) JSON output. I have tried to instruct it to drop the elements in the response that are deemed unsafe but it ignores (as far as I can tell) this instruction type.
Any clues?

OrangiaNebula · August 5, 2024, 6:41pm

Welcome to the forum.

Are you using the method described here Generate JSON output with the Gemini API | Google AI for Developers to supply the JSON Schema? Or are you supplying the JSON Schema in the prompt? The first method is preferred for Gemini 1.5 Pro.

Hope that helps!

jldupont · August 5, 2024, 7:07pm

Hi - I am using using the preferred method (i.e. not in the prompt).

OrangiaNebula · August 5, 2024, 9:35pm

Maybe if you post an example of malformed JSON the model gave you we might get some insight. Scrubbed of any sensitive information, obviously. The model should follow the instruction when provided in generation_config.

jldupont · August 5, 2024, 10:38pm

I am using (for now) AI Studio. Here is what I get for the prompt “RFID chip”:

{"capabilities": ["wirelessly transmit data", "track objects", "identify objects",

OrangiaNebula · August 6, 2024, 1:50am

I have an idea. The safety filters might be acting up. That would explain the output abruptly stopping. Try it again, but move the safety settings to block few or block none.

The prompt produced decent JSON when I tried it

{
  "capabilities": [
    {
      "category": "Identification",
      "description": "Provides unique identification for objects or individuals.",
      "examples": [
        "Inventory tracking",
        "Access control",
        "Passport identification"
      ]
    },
    {
      "category": "Data storage",
      "description": "Stores a limited amount of data related to the tagged item.",
      "examples": [
        "Product information",
        "Medical records",
        "Asset maintenance history"
      ]
    },
    {
      "category": "Tracking and location",
      "description": "Enables real-time or periodic tracking of tagged items.",
      "examples": [
        "Supply chain management",
        "Asset tracking",
        "Patient monitoring"
      ]
    },
    {
      "category": "Authentication and security",
      "description": "Verifies the authenticity of tagged items and prevents counterfeiting.",
      "examples": [
        "Product authentication",
        "Document verification",
        "Secure access control"
      ]
    },
    {
      "category": "Wireless communication",
      "description": "Communicates wirelessly with RFID readers to exchange data.",
      "examples": [
        "Inventory updates",
        "Data logging",
        "Payment processing"
      ]
    },
    {
      "category": "Sensor integration",
      "description": "Can be integrated with sensors to collect and transmit environmental data.",
      "examples": [
        "Temperature monitoring",
        "Humidity tracking",
        "Motion detection"
      ]
    }
  ]
}

jldupont · August 6, 2024, 11:38am

Yes I have played a bit with the safety settings but I am not willing to compromise on those. I consider the model returning invalid JSON as a bug: it should at least return valid JSON if it is not capable of filtering the unsafe content IMO.

LindaLawton · August 6, 2024, 1:47pm

can we be clear that you are using telling it to give you a json response and giving it the schema

jldupont · August 6, 2024, 3:31pm

Yes I am providing a schema.

OrangiaNebula · August 6, 2024, 8:08pm

If we agree that the root cause of incomplete JSON output is that the model sometimes triggers its safety settings (which can be verified by observing the little triangle near the top of the model response), I would suggest (a) modifying the topic heading to have …when safety is triggered instead of …sometimes, and (b) adding the bug tag.

This is a situation where the model is caught between two competing and contradictory requirements:

Traverse the provided schema and fill in leaf nodes with content per user prompt; repeat 5-7 times
Do not show content that violates one of the safety settings
It obviously prioritizes the second requirement, which results in the abrupt ending of the content generation.

When describing buggy behavior, it is a good idea when users also provide what they expect the model should do instead. For example, continue generating and substitute content that does not trigger safety settings, until the generated output is syntactically correct. Or, trim generated content to the last syntactically correct output that excludes content that triggers safety. These are just ideas, you should specify what you wish the model behavior should be.

Hope that helps!

jldupont · August 6, 2024, 8:27pm

After testing many other ways of providing instructions to remove unsafe content but continue generation with safe content, I have found none that work. Is there a way to file a bug report on this ?

OrangiaNebula · August 6, 2024, 8:41pm

In AI Studio, the three dots in the upper right corner - then send feedback.

OrangiaNebula · August 6, 2024, 8:42pm

Attempting to convince the model to not generate content that may trigger safety settings is futile. We all tried and failed.

jldupont · August 6, 2024, 8:43pm

Many thanks for this insight! I still believe we should at least expect to have well formed responses

mixart · August 9, 2024, 5:00pm

I’m using APIs not studio, and have tried so many things. The problems I get is JSON not escaping characters so the JSON structure breaks. I’ve tried instructing Gemini to not use quotes, to escape characters, and it isn’t consistent and just breaks often. Basically my responses break about 10% of the time due to this.

jldupont · August 9, 2024, 7:26pm

Although probably not related, but the Python code I get from the “Get Code” function in Studio contains malformed statements related misuse of quotes.
E.g. required = "["capabilities", "input", "entity"]",

John_Fiewor · August 10, 2024, 4:04pm

I’ve also been struggling with this.
I found that passing an example response helps to some extent. And I also provided the shape(schema) of the JSON .

Topic		Replies	Views
A Json response should be json parsable Gemini API api	17	1050	July 4, 2024
Invalid JSON Output - Generation Stops at Line Break Gemini API	2	401	August 7, 2024
Gemini 2.5 Pro inserting random text and format tokens around json responses Gemini API bug , api	39	1160	May 13, 2025
How to do batch Inference on Prompt Image pairs with Gemini API without getting errors Gemini API gemini-15 , bug , api	1	298	May 28, 2024
Unexpected InvalidArgument error for large response_schema Gemini API gemini-api	4	418	September 26, 2024

Gemini 1.5 Pro responds with incomplete JSON sometimes

Related topics