Gemini 3 significantly worse thant 2.5 Pro at long context. Temperature likely to blame

I 100% agree here. The first week or two, Gemini 3.0 seemed leaps and bounds above 2.5. It was able to maintain long conversations, talk intelligently and recall items from much early in the conversation and I felt was a huge upgrade over 2.5. Unfortunately over the last week or so I feel like the bottom has dropped out and I can’t trust or use Gemini for any long term projects. Some items I’m seeing.

  1. Chats will completely lose context and seems to completely forget earlier portions of the conversation relatively quickly. You can scroll back, see the prompt/response in question and Gemini will claim ignorance of the information. (See this thread, this is exactly what I’m seeing.)

  2. Chats will randomly prune information very early on in the conversation from the context window, and even though you can still see the prompt/response in myactivity.google.com, if you scroll back up in the chat, they are completely gone. If you ask Gemini about the earlier prompts, it claims it doesn’t know what you’re talking about, it’s not in it’s context window.

  3. 30-40% of the time, if you add attachments and ask for some sort of analysis of that document/image/etc., the model reports back, but analyzes a previous attachment from earlier in the conversation. When you try to correct it, “You’re analyzing the wrong image, please look at the last attachment from my previous prompt.” half the time it will analyze a different attachment from earlier in the conversation.

  4. In general, the model just seems ‘confused’ more often than not. Responding with answers that have no relationship to what you asked. As an example, I recently asked it to help me dial in some settings on the XSplit Vcam software, and it kept trying to tell me what settings to change in the XSplit Broadcaster software (which are 2 different apps).

I’m seeing this both in Fast and Thinking modes. I honestly loved 3.0 when it first came out and thought “This is it…this will be my tool going forward…” But it sounds like there was some large update around 12/4, and I really think something got majorly borked on the backend because it’s been a mess since then for me. Unfortunately, I don’t seem to see any way to roll back to 2.5, which would at least be better than what I’m getting from 3.0, and now since I can’t trust using the tool for anything but the most basic tasks and I’m actively looking at other models.

9 Likes