Gemini 2.5 Pro on Vertex sometimes returns empty string

Yovel_Cohen · August 17, 2025, 9:02am

Sometimes, when I use gemini 2.5 pro, for some chunks of the data (so same prompts, slightly different inputs) it returns an empty response.

Now, this has been a know issue regarding Prohibited content or max tokens reached.
But in my case, the finish_reason is None:

print(answer.candidates[0])
PyDev console: starting.
Candidate(
  content=Content(
    parts=[
      Part(
        text=''
      ),
    ],
    role='model'
  )
)
print(answer.candidates[0].finish_reason,answer.candidates[0].finish_message)
None None

That means that I’m paying for the input tokens completely normal and the request is valid, but Gemini is just lazy/has an API bug and I don’t get my results.
Now, I can try messing with the input data a bit or the temperature/seed and retry, but I’m still paying for something I didn’t get.
right now, for this feature, it happens about every 1/8 requests, at production scale that’s a lot of wasted money.

print(f'Candidate Tokens Count: {answer.usage_metadata.candidates_token_count}', ' -- ', f'\nInput Tokens Count: {answer.usage_metadata.prompt_token_count}')

Candidate Tokens Count: None  --  
Input Tokens Count: 21844

Lalit_Kumar · August 18, 2025, 6:36am

Hello,

Do you notice this issue only with complex tasks, or does it also occur with simple prompts?

Piotr_Jarecki · August 18, 2025, 11:42am

I started having the exact same:

{
“generations”: [
[
{
“generationInfo”: {
“finishReason”: “STOP”,
“index”: 0
},
“message”: {
“id”: [
“langchain_core”,
“messages”,
“AIMessageChunk”
],
“kwargs”: {
“additional_kwargs”: {},
“content”: ,
“id”: “run-48331704-214d-4461-9bff-351efe20be9d”,
“invalid_tool_calls”: ,
“name”: “model”,
“response_metadata”: {
“finishReason”: “STOP”,
“index”: 0
},
“tool_call_chunks”: ,
“tool_calls”: ,
“usage_metadata”: {
“input_tokens”: 36999,
“output_tokens”: 0,
“total_tokens”: 37048
}
},
“lc”: 1,
“type”: “constructor”
},
“text”: “”
}
]
],
“llmOutput”: {
“tokenUsage”: {
“completionTokens”: 0,
“promptTokens”: 36999,
“totalTokens”: 37048
}
}
}

0 output tokens, and just STOP

Lalit_Kumar · August 19, 2025, 6:36am

Hello,

We have raised your issue to the concerned team. Thank you for your patience.

Yovel_Cohen · August 20, 2025, 5:40am

both, mostly with complex, but if Gemini is unable to answer a query, it should say so, not have the API act as is everything’s fine and also bill me.

GV_Zap · August 20, 2025, 6:06am

From Monday, everyday the issue starts at 11.30am-12pm IST, and lasts until late night 9pm. Again it started today.

Yovel_Cohen · August 20, 2025, 10:28am

Thanks, now it’s just happening more and more also on 2.5 lite.
These are production services and supposed to be production ready models.
We can’t have that happening

user2739 · September 14, 2025, 4:12pm

This started happening to me yesterday as well. Previously, every complex task worked 100%, but now almost any task, regardless of complexity, stops immediately. I think we can prompt the model to disable the insta-stop feature.

Vivek_Vijayan · November 6, 2025, 8:12am

Facing the same problem. I was trying to run OpenHands agent using Gemini-2.5-flash through vertexai. Recently, out of the blue, this issue started to appear. Seems like issue is triggered by increasing the size of the system prompt. Anyways, feels like it’s better to switch to another more reliable model.

Topic		Replies	Views
Possible Bug in Gemini 2.5 Pro Behavior empyty response Gemini API api , models , gemini_25_pro	3	393	January 5, 2026
Gemini 2.5 Pro with empty response.text Gemini API gemini-20	238	22477	April 10, 2026
Gemini again started producing empty responses Gemini API gemini	16	2404	September 22, 2025
Gemini 2.5 Pro - Empty Response-Status 200 Gemini API api , gemini_25_pro	21	1393	August 27, 2025
Empty response.text from Gemini 2.5 Pro, despite no safety and max_tokens issues Gemini API api , gemini-2-5	17	2665	November 24, 2025

Gemini 2.5 Pro on Vertex sometimes returns empty string

Related topics