Critically poor performance of the latest gemini-2.5 model

Dawid_M · April 7, 2025, 10:13am

The model, despite instructions during code creation, exceptionally violates user guidelines and ruins code.

Even when forbidden from operating on specific code sections, the model will still completely change the forbidden sections when providing code.
Ignoring user guidelines and system instructions. Example: the model is supposed to fix a few things in my chatbot, but gemini focuses mostly not on the task but on changing the model used in the chatbot and rearranging parameters that the user had established.
Completely ignoring forbidden things. Example. I forbid using libraries X and Y, and instead suggest Z, the model will ignore this anyway and mindlessly use X or Y.

As a joke, I made system instructions with 100,000 tokens, where there was just one sentence copied hundreds of times saying ‘You are forbidden from X and Y’, and the model ignored it during operation anyway!

I feel that something went wrong. The model cares even less about system instructions and user guidelines than previous versions.

Karl_Ernst · April 11, 2025, 1:38am

This is not directly code related, but it is similar behavior.

Gemini 2.5 is driving me absolutely batty with its footnote behavior. I have a page and a half of instructions. Two chunks:

How do footnotes
How to verify them

It does neither consistently. Of course, it does do some things spectacularly well… like the research and writing I’m having it do. I still have to edit, of course… I just wish it would follow the basics.

Gerald_Fehringer · April 14, 2025, 5:49am

HI, I’ve similar results.

Overall quality in architect-mode is good, but the model has severe issues with editing larger files and I thought moving away from Claude 3.7 Thinking, because issue with context -window limits.

I’d not recommend to use 2.5 Pro exclusively and combine it with Claude. Also it gets quite expensive, especially if it has loop-issues/loosing context Just spent €90, as it kept wrong editing and i didn’t pay attention (expensive lunch-break )

In overall, I really like 2.5 Pro, because the 1m token window are what I sometimes need, but after the poor performance on API side (yes, also paid tier has over the day extreme issues/timeouts) I need to move on and build my own context-memory with better performing models.

Have fun & test!
Gerald

zara · April 14, 2025, 10:36am

Yeah, the performance became laughable ever since they replaced the Experimental model with the Preview.

Dawid_M · April 14, 2025, 10:48am

I’m noticing very strange behavior. Now ADK has been released. I thought great, I’ll be able to create amazing workflows and agent networks using 2.5. The result? I’ve never been so angry at Gemini. Instead of using ADK properly (I give it formatted information from the website and GitHub, provide examples), the 2.5 model still mixes up libraries, randomly throws in generativeai, inserts something like gemini-pro-vision. And I almost forgot to mention it makes 5 comments about ‘temperature: 0.7 #to add creativity this is very important very very’. I feel like the model has completely started ignoring system instructions, user input, tasks, and information, and instead works like one big random machine designed to irritate users.

Tim_M · July 6, 2025, 4:24pm

Having the same issue. I ask for specific updates to certain code but seems to get stuck in a loop no matter what I tell it. Usually becomes unusable after 4-5 hours of work. Seems to become “stupid” after that, but the next day it performs as it should, following instructions - becoming “smart” again. Memory retention issue?

Topic		Replies	Views
Gemini 2.5 Pro Preview is very bad! Google AI Studio api , models	25	3735	May 29, 2025
Gemini 2.5 Pro has gotten worse Google AI Studio models , model , gemini-2-5	15	738	July 24, 2025
Gemini 2.5-pro-preview-06-05 extremely slow Google AI Studio feedback , gemini-2-5	4	622	June 30, 2025
Gemini 2.5 Pro Preview 05-06 is Severely Underperforming Google AI Studio models , issues	1	889	May 19, 2025
Gemini 2.5 Pro's Response Quality Regression Google AI Studio models , gemini-25 , gemini-2-5	6	553	July 6, 2025

Critically poor performance of the latest gemini-2.5 model

Related topics