400 Invalid argument while using candidate_count>2 and long json in the prompt

Mika_Myrseth · February 27, 2025, 8:59am

Very specific issue, but I do not get this error when:

N=1
There is no JSON content in the prompt.

Is it just me? Anyone able to reproduce?

import json
from vertexai.preview.generative_models import GenerativeModel, GenerationConfig

# Create a JSON structure large enough to trigger the error (~100K chars)
items = []
for i in range(1600):
    items.append({
        "id": i,
        "name": f"Item {i}",
    })

test_json = json.dumps({"items": items}, indent=4)
print(f"JSON size: {len(test_json):,} characters")

# Initialize model
model = GenerativeModel("gemini-2.0-flash")

# Test: JSON with candidate_count=2 (will fail)
prompt = f"Analyze this JSON data:\n\n{test_json}"

# The key setting that triggers the error is candidate_count > 1
response = model.generate_content(
    prompt,
    generation_config=GenerationConfig(candidate_count=2)
)

jkirstaetter · February 28, 2025, 9:48am

Hi @Mika_Myrseth

As you’re using Vertex AI I was curious whether it’s related to the platform. Here’s the same example for Google AI using the newer google-genai package.

import os
import json
from google import genai
from google.genai import types
from dotenv import load_dotenv

from Imagen import response

load_dotenv()  # take environment variables from .env.
client = genai.Client(
    api_key=os.environ['GOOGLE_API_KEY']
)

# Create a JSON structure large enough to trigger the error (~100K chars)
items = []
for i in range(1600):
    items.append({
        "id": i,
        "name": f"Item {i}",
    })

test_json = json.dumps({"items": items}, indent=4)
print(f"JSON size: {len(test_json):,} characters")

# Test: JSON with candidate_count=2 (will fail)
prompt = f"Analyze this JSON data:\n\n{test_json}"

# The key setting that triggers the error is candidate_count > 1
response = client.models.generate_content(
    model="gemini-2.0-flash",
    contents=prompt,
    config=types.GenerateContentConfig(
        candidate_count=1,
    )
)
print(response.text)

It’s giving me the same HTTP 400 error when candidate_count is > 1.

google.genai.errors.ClientError: 400 INVALID_ARGUMENT. {'error': {'code': 400, 'message': 'Request contains an invalid argument.', 'status': 'INVALID_ARGUMENT'}}

The token count (prompt ~ 46k) looks OK though.

However, using a different model works.

gemini-2.0-flash Error
gemini-2.0-flash-exp Error
gemini-2.0-flash-thinking-exp-01-21 OK
gemini-2.0-flash-thinking-exp-1219 OK
gemini-exp-1206 Timeout and OK
gemini-2.0-pro-exp-02-05 OK
gemini-2.0-flash-lite Error

OK results come with the following note
there are 2 candidates, returning text from the first candidate.Access response.candidates directly to get text from other candidates.

Seems that it is model-related.

Cheers

Mika_Myrseth · February 28, 2025, 10:03am

Very interesting, thanks for testing!

Topic		Replies	Views
Unexpected InvalidArgument error for large response_schema Gemini API gemini-api	4	520	September 26, 2024
The response of gemini-2.5-flash does not have both candidates and finishReason frequently Gemini API gemini-flash-2-5	4	218	June 6, 2025
Error with too many enums in structured output Gemini API api , models , vertexai	4	139	July 8, 2025
Bug Report - [400 Bad Request] Request contains an invalid argument Gemini API gemini-15	3	228	February 14, 2025
400 Bad request error - infuriating! Gemini API api , model-code	10	1033	March 24, 2025

400 Invalid argument while using candidate_count>2 and long json in the prompt

Related topics