Persistent 429 Error in Gemini CLI (AI Pro Subscriber, Zero Recent Usage, Antigravity Works Fine)

Joslyn_Apestoso · April 3, 2026, 3:48am

Hi team,

I am experiencing the widespread 429 “Too Many Requests” (or endless “Thinking…”) error exclusively in the Gemini CLI, and I’d like to provide my data points to help isolate this bug.

Here is my current situation:

Account Status: I am an active Google AI Pro subscriber.
Zero Recent Usage: I have not used the Gemini CLI at all in the past two weeks. It is completely impossible for my account to have genuinely exhausted its quota, token limits, or triggered any real anti-abuse thresholds.
Isolated to CLI: This issue is strictly isolated to the Gemini CLI. My Antigravity IDE is functioning perfectly without a single rate-limit interruption.

As a front-end developer, having the terminal workflow completely blocked by a phantom 429 error—while other tools on the same Pro account work flawlessly—is highly disruptive. It strongly points to an entitlement desync specific to the CLI’s OAuth gateway.

Could the team provide an ETA on when the backend routing for Pro CLI users will be resolved?

Thank you!

Saw · April 4, 2026, 9:39am

Same here. It’s been over a week since my other account subscribed to Pro and can’t use it in CLI. Very disappointing.

Joslyn_Apestoso · April 6, 2026, 2:51pm

I don’t understand why Google hasn’t given a unified reply and solution yet. There are already a lot of them in the community!

Ryan_Romero · April 8, 2026, 3:05am

Fix: Removed and re-added Code Assist subscription for my account.

I started having this issue yesterday using Gemini Code Assist License assignment through GCP workspace (where you use google signin + plus project code to auth the cli). Perfect timing as I’m currently here at HumanX co. I was worried I was getting blocked for using a modified fork of the upstream gemini-cli project.

I went to the Google Gemini desk and showed them the error. The reps acknowledged the issue as soon as I said “429 error” but didn’t give a root cause. They recommended disabling “Retry Fetch Errors” which I had already tried, but then to drop and re add the subscription for my account in gcp which resolved the issue for me.

They mentioned that weird things can happen when connecting to “preview” models, like your account’s api calls hitting the correct endpoint but then getting stuck in a stale back-end route if they’re rolling nodes……they stressed “Preview” quite a bit.

NicholasC · April 9, 2026, 12:14pm

I have the same issue. I am on the Google One AI Pro plan (with Gemini Code Assist). I normally use Claude Code but sometimes I pop over to Gemini CLI to see how it’s improved. Feels like it’s gotten way worse. I asked a simple prompt like “which of my cores are P-cores vs E-cores” (a question Claude answers in mere seconds with details on which core is what), Gemini CLI took over 3+ minutes (I just quit the request because that’s stupid long). Looking at the debug it says “Too many requests 429”. They need to fix this bug asap.

Topic		Replies	Views
Gemini CLI Requests Failing with 429 – Possible Abuse Flag? Gemini API gemini_25_pro , gemini-cli	5	406	April 15, 2026
Persistent 429 Resource Exhausted (Check Quota) Error for 20+ Hours [Google AI Pro] Gemini API api , gemini , rate-limits	2	209	March 17, 2026
Consistent 429 Error (MODEL_CAPACITY_EXHAUSTED) on Gemini CLI for Google One AI Pro Subscriber Gemini API api , gemini-cli	4	347	April 1, 2026
A 429 error when calling the Gemini-2.5-pro API despite tier 1 Gemini API gemini-api , model-code	6	812	December 22, 2025
429 Quota Exceeded with Gemini Pro API Gemini API gemini-api	26	1742	November 10, 2025

Persistent 429 Error in Gemini CLI (AI Pro Subscriber, Zero Recent Usage, Antigravity Works Fine)

Related topics