Answer Multiplication

Vovaldi · August 4, 2024, 5:25pm

I use gemini 1.5 flash in my functions.

Simple function. Just generating a response to a text query. If I use this function directly from its file, everything works correctly.
But, if I use this function somewhere else like:

from some_file import some_func

then the response comes incorrect. The system duplicates the last message until it runs out of response length.

Also, the quality of the response itself gets worse. It’s like the model is getting a level dumber.

The model settings:

generation_config = {
    "temperature": 0.35,
    "top_p": 0.9,
    "top_k": 40,
    "max_output_tokens": 2048,
    "response_mime_type": "text/plain",
}

Has anyone had this problem?

GUNAND_MAYANGLAMBAM · August 28, 2024, 11:17am

Hi @Vovaldi , I tried replicating your issue by defining two modules “file1.py” and “file2.py” where “file1.py” contained the function to generate text and “file2.py” utilized the function define in “file1.py”. I observe that the generated response is perfectly fine. For your reference, I’m attaching screenshots of the code.

Hope it helps…

Topic		Replies	Views
Bug Report the model often starts creating repetitive sequences of tokens Gemini API gemini-15	12	843	April 11, 2025
Any simple Example of Gemini chat with history and function calling? Community gemini-15 , api	3	347	June 6, 2024
Google Gemini not provides response if tools are given Gemini API gemini-15 , bug	1	138	May 16, 2024
Google AI Studio Gemini Flash 2.5 repeating last response disregarding new input Google AI Studio python , gemini-flash	7	117	May 13, 2025
Model cannot focus on most recent user request when function calling Gemini API api , models	6	184	February 2, 2025

Answer Multiplication

Related topics