I am trying to use it to perform some processing of data, but it is highly unstable, i have retries and even with 3 retries it keeps failing, is there any instability? Also why are you guys deprecating it, if the costs of the next flash lite model are significantly higher than 2.5. Very disappointed.
I’m not even processing them in parellel they are running sequentially, this shouldn’t be happening
Same here is happening to me. I’m always below 40% success rate in the last days.
Same thing happening here, the last couple days it has gotten even worse
Hello all,
503 errors are due to our services being temporarily overloaded. Please see this pinned post for suggestions on how to handle this.
This answer is completely ridiculous, the models is not “temporarily overloaded”, how do you explain that yesterday and today i am still getting these errors, there very obviously is some problem here, this is completely unacceptable, we have implemented retries already, we attempt it 20 TIMES and it still manages to fail 20 times. And im sorry but refactor our whole flow just to use batch operations is a joke of an response, like yeah sorry we cant provide stability so please refactor everything you’ve done so that it runs in a more complicated and tricker way so that we can handle the load.
and when i check gemini status it says there is nothing wrong.