Gemini 2.5 Pro with empty response.text

Bryan_Hughes · April 27, 2025, 11:29pm

Running Gemini 2.5 Pro with grounded search sometimes returns empty response.text with finish_reason of STOP and no other reason. When I inspect the response dict it shows evidence of the search with some web meta information, but nothing else.

Any ideas on what is going on?

Here is my code:

def get_response(seed, model, system_prompt, user_prompt):
    api_key = os.environ.get('GEMINI_API_KEY')
    google_search_tool = Tool(
        google_search=GoogleSearch()
    )

    response = None
    client = genai.Client(api_key=api_key)
    try:
        response = client.models.generate_content(
            model=model,
            contents=user_prompt,
            config=GenerateContentConfig(tools=[google_search_tool],
                                         system_instruction=system_prompt,
                                         temperature=0,
                                         thinking_config=ThinkingConfig(include_thoughts=True),
                                         response_modalities=["TEXT"]))
        return response
    except Exception as e:
        print(f"\n    Caught exception: {e}.")
        if response is not None:
            usage = response.usage_metadata
            print(f'    Total Tokens: {usage.total_token_count}')
        return None

This is for an application that is focused on research. I am using 2.5 Pro stable.

GUNAND_MAYANGLAMBAM · April 28, 2025, 7:17am

Hi @Bryan_Hughes , Welcome to the forum.

Would you mind sharing system_prompt and user_prompt so that I can reproduce it on my end?

Abyl_Ikhsanov · April 28, 2025, 2:02pm

I have exactly the same issue. The problem occurs only sometimes and single prompt does not cause this issue but the same prompt after several runs could cause it so it does not seem to be prompt specific

Bryan_Hughes · April 28, 2025, 4:22pm

Hi @GUNAND_MAYANGLAMBAM,

Unfortunately our system_prompt is very sensitive, as is our user_prompt, which I appreciate isnt helping here. I can say this, while it is sensitive it is also not doing anything crazy. We have rubric of questions which we are using grounded search to return high quality results.

Also, this seems to be a new behavior in the past week or so, which I am also seeing more 500 responses. The prompts are very reasonable is number of tokens so perhaps there is more overloading going on?

Cheers,
Bryan

Bryan_Hughes · April 28, 2025, 5:39pm

Not sure if this helps, but there response object even has None for candidates, so not sure what could cause the model to return with this response. I would expect any other errors to be a status code (like 500). Also, the system prompt clearly directs that if is no answer, to specifically state this.

Mark_Daoust · April 28, 2025, 8:25pm

there response object even has None for candidates, so not sure what could cause the model to return with this response.

Yeah, that doesn’t look right. There’s also no prompt_feedback, so there’s nowhere to get a finish_reason. So +1 this looks like a bug in the API.

Bryan_Hughes · April 28, 2025, 8:28pm

While I would be happy to share the system prompt privately, the challenge is that it doesnt happen all the time with the same user prompt (which is where the question is). Though it seems to be happening on the same question (out of a set), so I can experiment more in extracting that question from the over all application runs and see if I can get it to reproduce in colab.

Also what is messed up is that I think I am being charged for each of these non-responses. Unfortunately I think that the billing API only refreshes every 24 hours.

Bryan_Hughes · April 28, 2025, 8:58pm

Some debugging notes. When the request fails where the elements in the response are all None (like in the screenshot), my code waits about a minute and then retries. Nothing is cached so it is calling client.models.generate_content each time, and each time nothing.

I decided to see if is the prompt so I set up a colab exactly like my code and executed with the same system and user prompt and worked perfectly, so I dont think it is the prompt, or if it, there is something random between colab and my python application where repeating the request in my python also has the same failed response.

Mark_Daoust · April 28, 2025, 9:03pm

Are you running the same SDK version in both?

Bryan_Hughes · April 29, 2025, 1:27am

Yeah. The colab is copy and past from the python application. Also another issue that seems to be new, I am now encountering a few requests that never return. I use Pycharm as my dev environment and when I hit pause and then stop (i.e. SIGINT), it always stops on return self._sslobj.read(len, buffer) in this function is ssl.py:

    def read(self, len=1024, buffer=None):
        """Read up to LEN bytes and return them.
        Return zero-length string on EOF."""

        self._checkClosed()
        if self._sslobj is None:
            raise ValueError("Read on closed or unwrapped SSL socket.")
        try:
            if buffer is not None:
                return self._sslobj.read(len, buffer)
            else:
                return self._sslobj.read(len)
        except SSLError as x:
            if x.args[0] == SSL_ERROR_EOF and self.suppress_ragged_eofs:
                if buffer is not None:
                    return 0
                else:
                    return b''
            else:
                raise

When I switch back to 2.0-flash, no issues. I am going to try running with 2.5 Flash Preview next.

Shahad_Mahmud · April 29, 2025, 2:19pm

I’m also facing the same issue with Gemini 2.5 Pro and Flash. The issue does not occur constantly and is very random.

I’m working with large prompts (unfortunately can’t share) and sometime the API runs for a long time, seems like the model generates long output, but the response is empty at the end!

Bryan_Hughes · April 29, 2025, 3:27pm

Is anyone else getting billed for these calls? It is hard to tell. The response does have the total tokens from the system prompt + user prompt which can add up.

Bryan_Hughes · April 29, 2025, 5:46pm

Hopefully this helps. Now I am getting this pretty regularly - 7 times in the last 11 requests (which are running in sequence):

Caught exception: 500 INTERNAL. {'error': {'code': 500, 'message': 'An internal error has occurred. Please retry or report in https://developers.generativeai.google/guide/troubleshooting', 'status': 'INTERNAL'}}.```

Isak_Romero · April 29, 2025, 6:29pm

I’m seeing the exact same issues here. Once it returns None as input, it will consistently return that every time I call it via the Python packages. Oddly enough, if i do the exact same prompt via the AIStudio UI it will still work fine. My specific prompts are <1k text and ~4k in document tokens if that helps.

Bryan_Hughes · April 29, 2025, 6:48pm

Hopefully this is fixed soon. I really need to be using 2.5. While incredibly SLOOOOOOW, it does produce much better results.

Bryan_Hughes · April 30, 2025, 1:54am

More observations. The bug seems to be in the python SDK. When I run the same system prompt and user prompt with the same python code (literally copy and paste) in colab, it works 100% of the time. After making some adjustments to my system prompt in colab (again running it back to back with no issues), I copy it back to my python code and it now has failed 5 times in a row.

DJ_Davey_J_John · May 3, 2025, 4:41pm

Bro it’s because the ai hit a page with a AI attack that generates random text slowly eventually linking back to a other loop then it just wastes resources or training time until it times out. They have a name one min I’ll look it up eh idk but it’s a response to ignoring robots.txt

Yasar_Arafath · May 3, 2025, 7:54pm

same issue but i am using simple java request

GUNAND_MAYANGLAMBAM · May 5, 2025, 4:28am

Hi, the issue has been escalated to the engineering team. Thanks.

Joao_Lazzaro · May 8, 2025, 10:52am

I am having the same issue. What was the engineering team feedback?

Topic		Replies	Views
"finishReason" : "MAX_TOKENS" - But Text is Empty Gemini API prompt , rate-limits	10	863	July 9, 2025
Gemini 2.5 Pro is great, it's just doesn't work Gemini API api , gemini-25	13	886	June 20, 2025
The response of gemini-2.5-flash does not have both candidates and finishReason frequently Gemini API gemini-flash-2-5	4	187	June 6, 2025
"An internal error has occurred" Google AI Studio bug , help_request	17	7244	May 31, 2025
Gemini 2.5 API bug: missing finishReason when max token limit is reached Gemini API api , gemini-api	1	520	April 30, 2025

Gemini 2.5 Pro with empty response.text

Related topics