To help me understand your question better, could you please share the exact prompt you’re using? Also, could you clarify if you’re providing image to Gemini or a text based mathematical equation?
Isn’t the issue the fact that it failed to see the image? It says as much in the thinking: “I’ve hit a slight snag, as I cannot actually see the image.” - so from that point on, it’s just making it up entirely. I don’t believe this is memory pollution; I think it’s just trying to be overly helpful instead of coming back and saying there was an error parsing the image data - which is what it should have done, but didn’t.
For context, I just tried your image and it worked fine:
You are right, this probably because the model cant see the image, I already repeated the prompt again and again, it finally can read the Integral problem
I apologize for identifying wrong subject, I primarily using this question as an benchmark first, but it seems the flaw is not the model itself, but its inability to read the image and trying to be helpful, its actually good but disastrous toward user that think this is hallucinating too much.
You are right to point that in the thinking its point its inability to read image..
No, you’ve definitely found what I would consider to be a “bug” in Gemini - if it can’t read the image, it should just bail out right there and not waste your API tokens!
I think you’ve found an error for sure, it just wasn’t memory-related. It’s strange because if an image fails to upload, you get an error from the API then. And if it fails to read the image, you get an error from that too (like if the image is too large or corrupted) - but you’ve found something else entirely, where the image uploaded just fine but got lost between the upload and the model thinking about it. Which is quite a fascinating bug!
I tried to reproduce the issue using the information you provided and in my case the model was able to understand the problem.
It looks like the model might be hallucinating in your case, but we can usually fix this by giving it a bit more context. When you’re passing an image to the model, try adding some additional information about that image in your prompt. This extra detail should help the model understand better and give better results.