Google hasn’t just slashed quotas for Claude Opus on the Ultra plan; they’ve effectively lobotomized the model. It’s now incapable of analyzing more than a few paragraphs. Google has capped the max_thinking_length at 1,024 tokens on the server side, which essentially turns Claude into useless junk. The theoretical limit for this parameter is 128,000; you need at least 32k for any kind of adequate performance.
Here is the current state of the Ultra plan (I won’t even mention the other tiers—those are basically a scam at this point):
The switch: Documentation was stealthily updated, changing “no weekly limits” to “Highest weekly rate limits” without any announcement.
Quota Gutting: Based on my usage, quotas have been slashed by about 5x. You get 1 hour of work followed by a 4-hour cooldown.
Claude models are not just restricted; they are functionally useless for professional tasks now.
Gemini 3.1 Pro remains “dumber” than a nerfed Claude with a 1,024 thinking limit.
Support is a ghost town: Sending feedback via the app is broken, and getting a response from support is impossible.
Confirmed.
I was wondering why my Opus feels dumb.
My current max_thinking_length is 1,024 tokens. This is relatively low and is not ideal for complex thinking tasks that require deep reasoning, multi-step analysis, or working through intricate logic chains.
For context:
1,024 tokens ≈ ~750 words of internal reasoning
This is sufficient for straightforward tasks (simple edits, lookups, direct answers)
It can be limiting for tasks requiring extended chain-of-thought reasoning, such as:
Complex architectural decisions
Multi-file refactoring planning
Debugging intricate logic across multiple modules
Detailed code analysis with many interacting components
holy moly, wtf is going on Google? You guys are afraid of «The bait-and-switch» term, so you changed my initial post? How is that even possiable?
This is the screenshot of my Alfred’s clipboard history. I copied it and pasted here, on forum as is, without editing. There is clearly «The bait-and-switch»
Thank you for bringing these concerns about quota limit in AI Ultra plan to our attention. Please be assured that I have shared your feedback with our internal team for further review.
We appreciate your continued patience as we work to enhance the Antigravity experience.
just tell them to up the limit of claude again. Otherwise be expecting full refund requests and CC chargebacks. I’m basically paying 250$ for NADA now as its completely useless without opus.
glad my subscription end today been a bad journey especially for the token usage hitting the limit - most of the time can only pay for api if i need to continue to work - have fun guys
if I switch to Claude code and find out opus answers more intelligently, I don’t see why I’d stick with antigravity anymore. Claude code max x20 is $50 cheaper than AI ultra even
I’m canceling today. I’m sure Google will adjust in the future, but right now their product line is an unusable joke. It’s clear that the AI bubble is bursting, because there’s no other reason why Google would try to take so much profit from such a bad product.
I think its just a necessary step in:
A.) Evaluating where the market / consumer pricing sits for the products available
B.) Weeding out the non-paying / skeleton projects from the past / dormant API’s being exploited etc…
I have faith it’ll come good - till then you can always run something local or jump ship I guess and suspend your service