How to translate "finish_reason": 4 from GenerateContentResponse into something meaningful?

Matthew_Turner · May 31, 2024, 7:20pm

I have been using the gemini-1.5-flash-001 model (free tier) to extract information from scientific publications; so there is a prompt describing the task (what to extract and how to format the reply) appended to the plain text of the scientific paper. Most of these have worked well, but 5 out of 30 return the following:

response:
GenerateContentResponse(
    done=True,
    iterator=None,
    result=glm.GenerateContentResponse({
      "candidates": [
        {
          "finish_reason": 4,
          "index": 0,
          "safety_ratings": [],
          "token_count": 0,
          "grounding_attributions": []
        }
      ]
    }),
)

All of the documentation says the finish reasons are from this list which are not listed by number. It is not clear which of these the “4” represents (or even if that is the right way to interpret it!).

Also, just to be clear, I was hitting the API well below the limits published and these errors persisted on another day when I tried several of these files individually (so I don’t think I broke the rate limit).

The call was:

model=genai.GenerativeModel(
    model_name="gemini-1.5-flash-001",
    system_instruction=system_prompt_text)

temp = 0.2
max_reply_tokens = 4096

model_config = {
  "temperature": temp,
  "max_output_tokens": max_reply_tokens,
  'response_mime_type':'application/json'
}

completion = model.generate_content(submit_prompt,
  generation_config=model_config,
  safety_settings={
    HarmCategory.HARM_CATEGORY_DANGEROUS_CONTENT: HarmBlockThreshold.BLOCK_ONLY_HIGH
  }
)

Note that the stuff about harm categories was due to the fact that some of the scientific papers discussed drug abuse and some were coming back with dangerous content warnings of “medium.”

Although it would take up a lot of space, I could upload a example prompt/paper. But these are on the order of 6000-8000 tokens, so not gigantic things. The prompt asks for JSON responses and for the other papers this worked well.

But I really just want to know how to translate this to the actual finish reasons. Thanks if you can help!

afirstenberg · May 31, 2024, 10:43pm

A finish reason response would not be because of quota limits.

The list of finish reason numbers is documented at generative-ai-python/google/generativeai/types/answer_types.py at f08c789741f30e49ecfb822540fd749920d62bcc · google-gemini/generative-ai-python · GitHub
From there, you can see that 4 is “RECITATION”, which broadly means that the reply it would have given draws from copyrighted or licensed material.

Logan_Kilpatrick · June 1, 2024, 12:34am

This is an odd behavior, I went ahead and opened a bug to make sure we just show the raw text and not a number: Finish reason should be text not a number · Issue #371 · google-gemini/generative-ai-python · GitHub stay tuned!

Topic		Replies	Views
Testing code execution results in: finish_reason": "RECITATION Gemini API	3	123	July 30, 2024
Proposed better handling of `MAX_TOKENS` finishReason Gemini API gemini-15 , feedback , api	6	137	May 20, 2024
finishReason `BLOCKLIST`? How to allow "forbidden terms" in the ouput Gemini API gemini-15 , help_request	1	42	October 27, 2024
Issue with Gemini 1.5 Pro EXP API: Getting Different Results Compared to AI Studio Playground Gemini API gemini-15 , api , models	0	33	October 25, 2024
Tips on how to increase token output size in GenerateContentResponse? Gemini API gemini-15 , api , models	1	88	September 28, 2024

How to translate "finish_reason": 4 from GenerateContentResponse into something meaningful?

Related Topics