Anyone experiencing google.api_core.exceptions.Cancelled: 499 The operation was cancelled

Same, very frequent 499 occurrences on the 2.0 Flash multimodal API!

Around here I implemented a retry method for errors 499, but sometimes I get the 503. I wanted to know if they have any relationship.

We also started getting a mix of 499s and 503 (service unavailable errors). Google support relayed an answer from the product team saying that the 499 means the server is exhausted and it should be treated as a 429 error. I’m surprised that this happened so suddenly without us increasing throughput. We’re already sending requests across regions.

@Logan_Kilpatrick can you confirm that this is the case and we should expect that this error rate is normal?

Thank you @Logan_Kilpatrick - Any update on this? I keep randomly receiving it.

Hey folks, Team is actively working on it. Will be fixed soon.

2 Likes

How can a bug like this stay open for 14 days. We don’t need a new model every other week so you can say ā€œwe beat the benchmarksā€.

We need stable models. I now started receiving empty answers, no code changes. It’s just unusable.

1 Like