Gemini 2.5 Pro with empty response.text

Rodger_Blom · June 24, 2025, 9:27am

Suraj_Singh · June 26, 2025, 9:53am

So till now i have encountered various versions of this issue :
. it doesn’t return any text or return empty response
. it returns couple of words then ------ till token ends, this is worse as no error is detected
. it directly returns -------
. But mostly it returns empty response , sometimes after couple of minutes wait.
we cant use either 2.5 flash or 2.5 pro in production
they both seemed very promising but not reliable at all.
been months being a visitor here in hope it will get resolved but still dissapointed, i thought after IO 2025 it would be different. Very market immature LLM.
response:
GenerateContentResponse(
done=True,
iterator=None,
result=protos.GenerateContentResponse({
“candidates”: [
{
“content”: {
“role”: “model”
},
“finish_reason”: “STOP”,
“index”: 0
}
],
“usage_metadata”: {
“prompt_token_count”: 518,
“total_token_count”: 6628
},
“model_version”: “models/gemini-2.5-flash-preview-05-20”
}),
)

Sam_Schneider · June 27, 2025, 12:21am

same issue, empty response when using 2.5 flash. makes it very hard to rely on

Zhuo_ZENG · June 27, 2025, 2:35am

Getting the same issue here. One thing to add on is that this problem just become so dominant(like 2/3) after i add mcp tools support to my framework.

Zhuo_ZENG · June 27, 2025, 3:35am

When I change back to the stable model instead of using the preview one, problem seems to be solved.

Mark_Borggraefe · June 29, 2025, 8:16am

@GUNAND_MAYANGLAMBAM Any updates on this? The seemingly random, yet frequent occurrence of this issues is hard to work around in production. Currently we are seeing this issue in about 1 out of 4 calls, exclusively with search grounded calls.

Added: Switching to Gemini 2.5 stable models has not resolved the situation for us, empty responses still occur, roughly with same frequency.

Suraj_Singh · June 29, 2025, 2:27pm

yes problem doesn’t seems to exist or supressed in stable models but they are too underperforming when compared to latest preview model both flash and pro.

Joao_Lazzaro · June 30, 2025, 12:56pm

Problem still exists for us with the stable models. I’ve been using flash a lot and the problem seems worse there than with pro (which still has the issue)

Suraj_Singh · July 1, 2025, 7:41am

Problem seems to have got worsen.

Rodger_Blom · July 1, 2025, 9:13am

Did you implement the retry logic suggested by @Bryan_Hughes? This is a proper workaround for now. We increased the retry limit to 10 (!!!) and this seems to at least always generate a valid output at some point. The problem for our use-case is that IF Gemini 2.5 Pro generates output its by far superior AND the cheapest compared to all other LLM’s out there. This thread is getting pretty long and persistant. I expect more involvement and communication from Google.

Suraj_Singh · July 1, 2025, 9:57am

yes i did but still sometimes it returns empty response and sometime even gives garbage out such as -------- or few words and ----------, these cases makes it difficult to even catch the error, i am using it to transcript handwritten texts then making those texts into some structured outputs.

krinal_akbari · July 4, 2025, 2:05pm

i am still facing this issue with ‘gemini 2.5-flash’ model, is there any solution for this?

Khaled_Sakka_Amini · July 12, 2025, 9:19pm

Ran into the same issue. After some prompt engineering, I managed to get results out of Gemini.

Mark_Borggraefe · July 16, 2025, 9:56am

Still facing this issue on a regular basis. For additional information:

only happens with grounded search (Python google.genai library)
when it fails for a specific prompt, it typically fails repeatedly for that prompt
the same exact prompt (with same model, settings and sys message) will work without issues in AI Studio
despite no response text being returned, we are still seeing token charges in billing console

tinchoz49 · July 18, 2025, 7:41pm

I can confirm that we are getting same random issue in Node.js using @google/genai.

Matt_Smith · July 18, 2025, 9:23pm

Same issue hitting 2.5-flash with searchTools using ruby

HTTP Status Code: 200
HTTP Status Message: OK
Response Headers:
content-type: application/json; charset=UTF-8
vary: Origin, X-Origin, Referer
server: scaffolding on HTTPServer2
x-xss-protection: 0
x-frame-options: SAMEORIGIN
x-content-type-options: nosniff
connection: close
transfer-encoding: chunked
Response Body Encoding (before force_encoding): ASCII-8BIT
Response Body (raw, potentially with encoding issue): nil
GeminiService error: “\xC3” from ASCII-8BIT to UTF-8

Ziam_Ziamtech · August 4, 2025, 7:17pm

Hallo everyone,
Sincere apologies for the inconveniences you all are experiencing. Please be advised that all priorities are being given to solve ongoing errors. The infrastructure is at its earliest stage of development and we are doing everything we can to make your experience less stressful. Thank you for Continued comments. They do help us identify the areas requiring immediate response.

Thank you

Bryan_Hughes · August 5, 2025, 2:28am

Thanks for the update. Do you have any response to the fact that for many of us getting empty responses back, that it seems like we are still be charged. Unfortunately because Google Cloud does not do billing on a per prompt basis, it is hard to isolate other that it feels like I am getting charged. We are in production and can not just experiment.

Cheers,
Bryan

tanaka_taro · August 5, 2025, 4:08am

For me, the model suddenly began returning an empty text with finishReason set to MAX_TOKENS.
Switching to the lite model temporarily resolve the issue, but it started failing again the next day.

{
  "Response Body": {
    "candidates": [
      {
        "content": {
          "parts": [
            {
              "text": ""
            }
          ]
        },
        "role": "model",
        "finishReason": "MAX_TOKENS",
        "index": 0
      }
    ],
    "usageMetadata": {
      "promptTokenCount": 316,
      "totalTokenCount": 715,
      "promptTokensDetails": [
        {
          "modality": "TEXT",
          "tokenCount": 316
        }
      ],
      "thoughtsTokenCount": 399
    },
    "modelVersion": "gemini-2.5-flash"
  }
}

It turns out, cursor had renamed “generationConfig” to “config” and I had missed this update.
The issue was very confusing as the LLM call seemed to work initially despite misconfiguring it.

const response = await this.gemini.models.generateContent({
	model: this.model,
	contents: fullPrompt,
	// generationConfig: {
	config: {
		temperature: temperature,
		maxOutputTokens: maxTokens
	}
});

Tyler_Neill · August 8, 2025, 1:23am

I’ve been using the same prompt setup with 2.5 Pro for several weeks. Just started getting this same problem of a response with no content but finish_reason stop a few days ago. I have a retry loop already set up, and it works, but yeah, it’s getting worse, and are we being charged? I suspect that this is traffic related, since simply retrying works. Probably I should add an exponential backoff to space out the attempts, to avoid getting as many (pricey) failures…

Topic		Replies	Views
Empty response.text from Gemini 2.5 Pro, despite no safety and max_tokens issues Gemini API api , gemini-2-5	17	2083	November 24, 2025
"finishReason" : "MAX_TOKENS" - But Text is Empty Gemini API prompt , rate-limits	12	2959	July 18, 2025
Gemini again started producing empty responses Gemini API gemini	16	2010	September 22, 2025
Intermittent failure with FinishReason STOP Gemini API bug , gemini-25	2	616	September 15, 2025
Empty Response from AI running my clients Gemini API api , gemini-2-5	8	415	September 21, 2025

Gemini 2.5 Pro with empty response.text

Related topics