Hi everyone, I made a Flutter App with Vertex AI, and I constantly get a quota error ERROR Quota exceeded for aiplatform.googleapis.com/generate_content_requests_per_minute_per_project_per_base_model
This is happening when using Gemini Pro or Flash models.
My Google Cloud account is configured with Billing correctly, and when I go to the quotas section, the quota value for the generate_content_requests_per_minute_per_project_per_base_model
service is at 1
as a maximum, and to ask for more quota they tell me that I have to contact the sales team, which I have already contacted repeatedly but I never receive a response.
I saw this happen to many people (I saw posts on Stackoverflow, Reddit) and I can’t find any solution.
I hope someone on the Google team will look into this.
Best.
Maybe you are giving more req to the model per minute there is a limit of message that you can send the req to the model i think you have exceed that limit that why this error is coming or make a new api maybe this will fix your problem.
I don’t think that’s it, it’s just me using the app, and the interaction is similar to a chat. I don’t know if the Vertex SDK is using the correct service or if it’s a quota issue.
use ai studio that will help you
1 Like
I have the same problem.
The quota is limited to 1 RPM instead of 60, I am on a pay-as-you-go account. Can’t find any solution for a long time.
Yes, I don’t know where to look anymore. The most annoying thing is that I already sent the contact to the sales team “to request a more quote” and I never received a response.
Indeed. I have written four letters to the sales team and never got any reply from them. And I have surfed the Internet and found many posts with the same issue description, but no solution so far. If you find any, please let me know. If I find anything, I will post here.
1 Like