That means that I’m paying for the input tokens completely normal and the request is valid, but Gemini is just lazy/has an API bug and I don’t get my results.
Now, I can try messing with the input data a bit or the temperature/seed and retry, but I’m still paying for something I didn’t get.
right now, for this feature, it happens about every 1/8 requests, at production scale that’s a lot of wasted money.
both, mostly with complex, but if Gemini is unable to answer a query, it should say so, not have the API act as is everything’s fine and also bill me.
Thanks, now it’s just happening more and more also on 2.5 lite.
These are production services and supposed to be production ready models.
We can’t have that happening
This started happening to me yesterday as well. Previously, every complex task worked 100%, but now almost any task, regardless of complexity, stops immediately. I think we can prompt the model to disable the insta-stop feature.