Gemini 2.5-pro has become unusable. When 03-25 was released it could perform the task of summarizing long context under 3 minutes. When it was updated to 05-06 this time went up to 8 minutes with a very frequent ocurrence of timeouts in 10 minutes. Now, 06-05 is never able to finish the same task even if the thinking budget is set to the minimum.
1 Like
Hi Joao_Lazzaro,
With the release of new models and old models being deprecated, we have seen a huge bump of daily active users which could be one of the reasons for higher processing times..
Can you please provide some of the long context prompts that was used to test the model along with processing times.. It helps us to compare the results from our end and can successfully rule-out other performance reasons.
I cannot share the prompt since it consists mostly of proprietary documents (around 190k tokens of documents) and some system prompt in the style of “You are a helpful assistant”, plus a user question. I tried to use 06-05 yesterday again with no success so we’re still testing with 05-06.