Fought this for 4 weeks. Went through billing support, they sent me all around. Finally got referred to sales.
Here is the TLDR. Contact your Project Rep. Schedule a meeting. Sign up with a partner. I think this is google’s way of ensuring they get your money. They aren’t going to increase your limits(and effectively what they are letting you “spend”, without knowing you are good for the bill)
They recommended I sign with one of their google cloud premier partners. I did this, and SAME day was granted a custom limit on Vertex and resolved my issues. Still hit 429s because google is clearly having internal problems and not telling us.
Gemini 3 Pro is screwed. 2.5 pro is the only way I don’t get rate limited currently. Hopefully they announce something soon but they seem to be silent about this.
The worst part is they have ABSOLUTELY ZERO documentation about this.
But seriously. Contact your cloud rep and schedule a meeting. The rep can help with everything. I couldn’t even file a f*** ticket with support. Still can’t. But I can email the rep/now my partner on this and get stuff sorted so much quicker.
I fought this for a month. Finally have some sort of a path forward.
My Understanding of why they do this: Like I said with the billing, they aren’t going to give you the moon in resources and have you not pay the bill. By going through a partner, the partner then is “on the hook” for the bill. So google gives them the ability to give out tons of resources. I’m sure there are other reasons yada yada but it comes down to money.
Hope this helps and I hope people can see this.
Tags: Vertex AI API 429 errors, Vertex AI API rate limits, Vertex AI API rate limit increase, Vertex AI rate limit increase, Resource Exhausted, Unable to request increase on base models vertex ai api, gemini 3, gemini 2.5 pro,