Unexpected InvalidArgument error for large response_schema

seidtgeist · August 26, 2024, 11:36am

I’ve encountered an issue with the Gemini API where there seems to be an undocumented size limit for the response_schema parameter in GenerationConfig. When attempting to use a schema with a large number of properties or long property names, the API throws an InvalidArgument error. What’s worse, there’s no error description.

This behavior isn’t mentioned in the documentation, and it’s unclear what the exact limits are. Has anyone else experienced this? Are there any official guidelines on the maximum size or complexity of the response_schema?

I’ve attached a minimal reproducible example demonstrating the issue.

I encourage you to try different parameter values. For example:

With these parameters the request succeeds consistently:
- num_properties = 42
- property_name_length = 34
However, these consistently causes an InvalidArgument exception:
- num_properties = 43
- property_name_length = 34
And so do these:
- num_properties = 42
- property_name_length = 35

This leads me to believe that the combination of combined property name lengths could be the issue.

I’m looking forward to someone looking behind the GAPIC veil

import copy
from pprint import pprint
import random
import string
from vertexai.generative_models import GenerationConfig, GenerativeModel
from google.api_core.exceptions import InvalidArgument


prompt = "Respond according to the JSON schema."
num_properties = 40
property_name_length = 40

properties = [
    "".join(random.choices(string.ascii_lowercase, k=property_name_length))
    for _ in range(num_properties)
]

json_schema = {
    "type": "object",
    "properties": {name: {"type": "string"} for name in properties},
}

model = GenerativeModel("gemini-1.5-pro-001")

try:
    response = await model.generate_content_async(
        contents=prompt,
        generation_config=GenerationConfig(
            temperature=0.0,
            response_mime_type="application/json",
            response_schema=copy.deepcopy(json_schema),
        ),
        stream=True,
    )
except InvalidArgument as e:
    print("Request failed as expected with InvalidArgument error:")
    print(e)
    print(
        f"generation_config.response_schema had {num_properties} properties, {property_name_length} characters each:"
    )
    pprint(json_schema)

seungduk · September 18, 2024, 1:59pm

Encountering the same issue here.

github.com/google/generative-ai-go

Response schema seems having a length limit

opened 06:34PM - 14 Sep 24 UTC

seungduk-yanolja

type:bug type:api-issue

### Description of the bug: I have been debugging a weird issue for the pas…t few nights and finally found the tipping point between inputs that cause errors and those that do not. I get an error if the schema is too long, and no error if the schema is shortened a little. ### Actual vs expected behavior: First, please note that I have converted the OpenAI schema into a Gemini one. This version gives me an error, ``` { "temperature": 0, "model": "gemini-1.5-flash", "response_format": { "type": "json_schema", "json_schema": { "schema": { "properties": { "accuracy_of_message_delivery": { "$ref": "#/$defs/AppropriatenessOfLanguageUse" }, "appropriateness_of_language_use": { "$ref": "#/$defs/AppropriatenessOfLanguageUse" } }, "$defs": { "AppropriatenessOfLanguageUse": { "properties": { "verb_tenses_correct": { "$ref": "#/$defs/Score" }, "conjunctions_and_transitions_effective": { "$ref": "#/$defs/Score" }, "punctuation_appropriate": { "$ref": "#/$defs/Score" }, "pronouns_used_correctly": { "$ref": "#/$defs/Score" }, "articles_and_prepositions_correct": { "$ref": "#/$defs/Score" }, "capitalization_rules_followed": { "$ref": "#/$defs/Score" }, "idiomatic_expressions_natural": { "$ref": "#/$defs/Score" }, "vocabulary_choice_appropriate": { "$ref": "#/$defs/Score" }, "sentence_structures_appropriate": { "$ref": "#/$defs/Score" }, "subject_verb_agreement_maintained": { "$ref": "#/$defs/Score" }, "active_passive_voice_appropriate": { "$ref": "#/$defs/Score" }, "grammar_rules_followed": { "$ref": "#/$defs/Score" }, "technical_terminology_consistent": { "$ref": "#/$defs/Score" }, "spelling_correct": { "$ref": "#/$defs/Score" } }, "type": "object" }, "Score": { "type": "object", "title": "Score", "properties": { "comment": { "type": "string", "title": "Comment" }, "score": { "type": "integer", "title": "Score" } } } }, "type": "object" } } }, "messages": [ { "content": "Evaluate the texts in a JSON format.", "role": "user" } ] } ``` but this one does not. ``` { "temperature": 0, "model": "studio/gemini-1.5-flash", "response_format": { "type": "json_schema", "json_schema": { "schema": { "properties": { "accuracy_of_message_delivery": { "$ref": "#/$defs/AppropriatenessOfLanguageUse" }, "appropriateness_of_language_use": { "$ref": "#/$defs/AppropriatenessOfLanguageUse" } }, "$defs": { "AppropriatenessOfLanguageUse": { "properties": { "verb_tenses_correct": { "$ref": "#/$defs/Score" }, "conjunctions_and_transitions_effective": { "$ref": "#/$defs/Score" }, "punctuation_appropriate": { "$ref": "#/$defs/Score" }, "pronouns_used_correctly": { "$ref": "#/$defs/Score" }, "articles_and_prepositions_correct": { "$ref": "#/$defs/Score" }, "capitalization_rules_followed": { "$ref": "#/$defs/Score" }, "idiomatic_expressions_natural": { "$ref": "#/$defs/Score" }, "vocabulary_choice_appropriate": { "$ref": "#/$defs/Score" }, "sentence_structures_appropriate": { "$ref": "#/$defs/Score" }, "subject_verb_agreement_maintained": { "$ref": "#/$defs/Score" }, "active_passive_voice_appropriate": { "$ref": "#/$defs/Score" }, "grammar_rules_followed": { "$ref": "#/$defs/Score" }, "technical_terminology_consistent": { "$ref": "#/$defs/Score" } }, "type": "object" }, "Score": { "type": "object", "title": "Score", "properties": { "comment": { "type": "string", "title": "Comment" }, "score": { "type": "integer", "title": "Score" } } } }, "type": "object" } } }, "messages": [ { "content": "Evaluate the texts in a JSON format.", "role": "user" } ] } ``` ### Any other information you'd like to share? The Go lang SDK does not support `$defs` and `$ref` so I converted them to the SDK supported format.

Evan_Carothers · September 19, 2024, 3:56pm

Happening to me as well! Here’s more duplicates:

https://www.googlecloudcommunity.com/gc/AI-ML/Unexpected-400-errors-with-Generated-Output-Schema/td-p/807575

Evan_Carothers · September 19, 2024, 4:00pm

Interestingly, I’ve noticed that I can pass a substantially larger request schema to the API on VertexAI, but with AI Studio it fails with a much smaller schema

seidtgeist · September 26, 2024, 6:27pm

I noticed your reply in the other thread and re-ran my example snippet above. Unfortunately it’s still returning the same exception.

Would you mind sharing an example that’s working for you, or trying to run the example above in your account?

Topic		Replies	Views
400 Invalid argument while using candidate_count>2 and long json in the prompt Gemini API prompt	2	391	February 28, 2025
JSON Mode - Internal Server Error 500 Gemini API	7	716	December 25, 2024
Random Endless \n Output in Gemini API 1.5 Pro Responses Gemini API gemini-15 , model	15	802	July 17, 2025
Use JSON output with Gemini gemini-1.5-pro-001 Gemini API gemini-15	3	877	June 11, 2024
Truncated Response Issue with Gemini 2.5 Flash Preview Gemini API bug , gemini-flash	38	1465	July 12, 2025

Unexpected InvalidArgument error for large response_schema

Related topics