Anyone experiencing google.api_core.exceptions.Cancelled: 499 The operation was cancelled

Same, very frequent 499 occurrences on the 2.0 Flash multimodal API!

Around here I implemented a retry method for errors 499, but sometimes I get the 503. I wanted to know if they have any relationship.

We also started getting a mix of 499s and 503 (service unavailable errors). Google support relayed an answer from the product team saying that the 499 means the server is exhausted and it should be treated as a 429 error. I’m surprised that this happened so suddenly without us increasing throughput. We’re already sending requests across regions.

@Logan_Kilpatrick can you confirm that this is the case and we should expect that this error rate is normal?

Thank you @Logan_Kilpatrick - Any update on this? I keep randomly receiving it.

Hey folks, Team is actively working on it. Will be fixed soon.

2 Likes

How can a bug like this stay open for 14 days. We don’t need a new model every other week so you can say “we beat the benchmarks”.

We need stable models. I now started receiving empty answers, no code changes. It’s just unusable.

1 Like

Hey guys, I’m running an experiment for a school paper and I’ve included Gemini models via API. In total I needed to generate 128 results with Gemini 3 Pro and 128 results with Gemini 3 Flash. I’m on tier 1 billing and was able to generate most of the responses throughout yesterday afternoon but at one point yesterday the API started responding with either 499 operation cancelled or 504 deadline exceeded. It was happening intermittently with repeated request usually getting a proper response but now with only 16 results needed remaining I am not able to finish the data set.

Naturally during testing of my script and tweaking the variables needed for the experiment the total number of requests was much higher than 256 but I checked the dashboard and I have not reached any of the limits, neither RPM/TPM/RPD (alhough I am close with 225/250 on RPD on gemini 3 pro).

As I am writing this, I’m trying 4 remaining results for gemini 3 pro and only 1 came back successful, 2 returned 504 and 1 returned 499. With Gemini 3 Flash I am not able to get a successful response at all. The timeout is set to 1 minute, is there something I can do from my end?