The current mode 06/05 is significant improvement after 05/06 version, However after doing a longer chat, this model will stop thinking more and more frequently even if thinking and budget enable, as we know if gemini 2.5 stop thinking, the quality will decline a lot.
Hey @Yingrong_Lin, Thanks for sharing your experience. Could you estimate the token count at which you start to notice this drop-off in thinking
despite having thinking and budget enabled?
[Hello, the non-thinking will gradually appear when token amount surpass 50-100K for conversation, which only appear for chat, but if archive conversation into txt file it will not appear
I have a long chat window with 830,000 tokens, with the same issue.
To be honest those pro models have been showing this very same issue, we had this with 05-06 too. I mean like, sometimes they actually decide to think or not to think. And even if you explicitly tell them to think or use thinking mode, they ignore it(sometimes it works).
As reasoning models, I think that they should either be forced to think for each response or we should simply have a thinking button(like we have in flash models).
This is why AI studio really has to be unlimited, because many times you are going to be resetting the same question until the AI starts to think
Yes, This version is stronger, but without thinking the dip of quality is obvious to see, which the thinking process likely a double iteration for answer
Is this a feature or a bug?