@Mustan_lokhand
I have the exact same request
I was using the gemini CLI today on my AI Pro plan and got throttled so hard that one request got throttled for 30 minutes
I got this snippet from the debug console there:
' "status": "RESOURCE_EXHAUSTED",\n' + β
β ' "details": [\n' + β
β ' {\n' + β
β ' "@type": "type.googleapis.com/google.rpc.ErrorInfo",\n' + β
β ' "reason": "MODEL_CAPACITY_EXHAUSTED",\n' + β
β ' "domain": "cloudcode-pa.googleapis.com",\n' + β
β ' "metadata": {\n' + β
β ' "model": "gemini-3.1-pro-preview"\n' + β
β ' }\n' + β
β ' }\n' + β
β ' ]\n' + β
β ' }\n' + β
β '}\n' + β
β ']', β
β headers: { β
β 'alt-svc': 'h3=":443"; ma=2592000,h3-29=":443"; ma=2592000', β
β 'content-length': '630', β
β 'content-type': 'application/json; charset=UTF-8', β
β date: 'Thu, 09 Apr 2026 18:50:54 GMT', β
β server: 'ESF', β
β 'server-timing': 'gfet4t7; dur=7056', β
β vary: 'Origin, X-Origin, Referer', β
β 'x-cloudaicompanion-trace-id': '65bd1c35535158af', β
β 'x-content-type-options': 'nosniff', β
β 'x-frame-options': 'SAMEORIGIN', β
β 'x-xss-protection': '0' β
β }, β
β status: 429, β
β statusText: 'Too Many Requests', β
β request: { β
β responseURL: 'https://cloudcode-pa.googleapis.com/v1internal:streamGenerateContent?alt=sse' β
β } β
β }, β
β error: undefined, β
β status: 429, β
β Symbol(gaxios-gaxios-error): '6.7.1' ββ
β } ββ
β°βββββββββ
I started at 0% usage on my quota, and got 20 API request errors with an error rate of (76.9%)
Only 195,536 tokens
most being cache reads 112,400 (65.9%)
Due to retries
I am getting throttled so hard that my plan is unusable. I love the AI pro plan stack and features, but it is coming to the point where it is ridiculous to try and use.
My next step is to not use Google AI Pro, and up my Claude plan, even though I love the Gemini features.
Please help me with this issue