Very slow response time on the new 2.5 Pro 0605 model

Emir_Arditi · June 6, 2025, 11:58am

We have been experiencing significantly slower response times on the newly released 2.5 pro 0605 model. For comparison, on our testbed with structured output tasks, gemini 0506 has an average response time of 15.4 seconds, while 2.5 pro 0605 exceeds our 180 second timeout limit 28/30 times. We have also tried to reduce this by lowering the thinking budget to 2048 on 0605, but even on this trial, 26/30 trials exceeded our 180 second limit. We are using langchain google genai library with the following library versions:

langchain: 0.3.24
langchain-google-genai: 2.1.4
google-genai: 1.16.1

does anyone face a similar problem? Do you have any suggestions/solutions?

Krish_Varnakavi1 · June 6, 2025, 9:04pm

Firstly Welcome to the Google AI Forum!

Thank you for reaching out and providing detailed information about your experience with the Gemini 2.5 Pro 0605 model.

We understand that you’re encountering frequent timeouts, even after adjusting the thinking budget. This issue has been noted by other users as well, particularly with large prompts or complex tasks. While the 0605 model introduces enhancements in reasoning and coding, it may require more processing time, especially for intricate requests.

To mitigate this:
Optimize Prompt Design: Break down large tasks into smaller, more manageable prompts to reduce processing time.
Monitor API Usage: Keep an eye on your API usage to ensure you’re within the recommended limits and avoid potential throttling.

If the issue persists, please provide additional details about your implementation, such as the specific tasks you’re performing and any error messages received. This information will help us assist you further.

We appreciate your patience and understanding as we work to improve the Gemini 2.5 Pro model.

David_Wiles · June 13, 2025, 4:27pm

Two days ago I migrated one of my “creative writing assistant” prompts over to Gemini 2.5 Pro Preview from the older 05-06 mdel, and certainly noted a slowdown in response with errors in Google Chrome [This page is slowing down Chrome [Wait] [Stop]] which I had to press [Wait 2-4 times before I got results - after waiting 30-80 seconds. Today however on the 13th, the response is a lot faster and I am getting not errors, the model also seems to be doing better with the task. Just an observation. I hope it continues!

Guven_Candogan · June 15, 2025, 10:27am

We have the same issue.
Recently switched to Gemini 2.5 and experiencing significant response delays

Krish_Varnakavi1 · June 27, 2025, 10:37pm

Hi @Guven_Candogan,

We have experienced significant traffic which caused delays in response.. Are you still facing this issue?

Topic		Replies	Views
Gemini 2.5-pro-preview-06-05 extremely slow Google AI Studio feedback , gemini-2-5	4	1120	June 30, 2025
Gemini-2.5-pro accessed over https://generativelanguage.googleapis.com/v1beta/openai/ has dramatic latency increase Gemini API api , model , gemini-2-5	10	1182	July 21, 2025
Gemini 3 Pro does not responds or responds very slow Gemini API models , gemini , gemini-3	30	4344	April 25, 2026
GenAi Apis is taking much time in generating response Gemini API gemini-2-5 , genai	1	366	July 29, 2025
Streaming API is too slow Gemini API prompt , generative-ai	2	280	January 13, 2026

Very slow response time on the new 2.5 Pro 0605 model

Related topics