503 - with gemini Priority inference

Hey,

Google released few weeks ago and solution for the 503 issue, where if you pay twice the price they will put you in a fast lane and at peak times you should have access to the service

We allowed for priority as 503 is a big problem for us, but it just doesn’t work.

It has never resolved the 503 issues and the api is set up properly

Git issue here:

Google should separate the API pay method and subscription plan hardware pool. API pay much more and need uninterrupted work flow.

Hello @All,
Can you share your project ID via DM?