I’ve been using gemini 2.5 pro on the Gemini developer API for few weeks now.
While I acknowledge the fact it’s in preview, it’s almost not possible to even evaluate it, since in almost every session I try to use it with multi-modal prompting (images + text), it just returns a 5XX error.
Am I the only one who is experiencing this?
When it works, its results are good, the latency’s fine, and it seems great.
But currently, it’s barely testable as you get 500 errors constantly.
Hi @Yovel_Cohen, Welcome to forum!!
Thanks for the feedback. Sometimes we see 5xx error due to server side error or migration or overload issue. This is an intermittent issue which will resolve by itself. This will be mitigated once it becomes a stable model.
I recommend Temporarily switch to another model like 2.5 flash.
Hi @Govind_Keshari, thanks for your reply.
It’s not only that 5xx errors are almost the norm for the model, but also empty responses, as stated here:
Makes me wonder when and if I’ll be able to test the model, on the one hand Google’s pushing it, including through it’s sales. On the other, I can’t even test it.
I don’t think using another model like 2.5 flash is the answer, because they don’t serve the same purpose…
true, I also facing same issue for long time
Hey @Yovel_Cohen, Are you also using grounding by any chance or any other tool??
Hey @Yasar_Arafath, I will check on this and get back to you.
no, I use structured output schema, but it’s still none even if I remove it from the request.
my input includes text + images
Thanks for confirming, I will check on this.
Hey @Yasar_Arafath, This is a different case, if the output token count is more than max output token count then we see empty response.
Our team is working on 5xx error, meanwhile if you can share your code if possible so i can repro the issue.
I will have to agree with OP here. I’ve been trying to create a coding CLI agent for Gemini, put it’s been tough going to be honest..  500 errors ever so often, and on top of that the model does not produce any ouput at times and just stops, and it just plains refuses to actually format strings correctly for tool calls. Well, I’m just frustrated at this point, but I’ll jack in Claude to see how that handles for comparison
  500 errors ever so often, and on top of that the model does not produce any ouput at times and just stops, and it just plains refuses to actually format strings correctly for tool calls. Well, I’m just frustrated at this point, but I’ll jack in Claude to see how that handles for comparison
IF Gemini 2.5 outputs at all, it is absolutely great!. This “issue” truly is a showstopper to use it in production. Please resolve it asap, I love to use this in production.
Hey, gemini-2.5-pro is now a GA stable model, so this issue has been resolved.
Thanks
Sorry @GUNAND_MAYANGLAMBAM but this simply is not true. I moved over to GA as soon as it arrived and still this issue persisted every day. This is not strange:
Note: Effective June 19, 2025, the gemini-2.5-pro-preview-05-06 endpoint will switch to serving the Gemini 2.5 Pro 06-05 version. This version of the model is the same as the version released to general availability.
Source: Gemini 2.5 Pro | Generative AI on Vertex AI | Google Cloud
So the GA version is the Gemini 2.5 Pro 06-05 version which still served this issue.
I tried vibe coding. It starts out really well, and aftter say 30+ turns, it is a mess. It is almost impossible to use the product. It loses its context, repeats itself, and difficult to move forward. I do not think it is ready for prime time atlease for coding.
Hi @Jagadish_Channagiri. This has nothing to do with Gemini. This is called the “Context Degredation Syndrome”. Every LLM has this “undocumented feature”… This is a great article about the subject: jameshoward .us 2024/11/26/context degradation syndrome-when-large-language-models-lose-the-plot
I understand. My only issue there is no way to reset the context. If I could “zero” or reset the context, I can focus on specific issues and resolve them. Thanks and Appreciate your answer @Rodger_Blom