Hi Antigravity Team,
I am writing to report a critical and frustrating inconsistency regarding quota consumption for Claude models under the Google AI Ultra subscription, specifically affecting users in the UTC+8 time zone. This issue appears to have started earlier this week.
The Core Issue:
Since the beginning of this week, I have observed that initiating just a single conversation session with the Claude model (either Sonnet or Opus) within Antigravity immediately consumes approximately 60% of my total quota.
This completely defeats the purpose of the Ultra subscription. I upgraded to the Ultra tier specifically to access significantly larger capacity for heavy development workflows, not to have my entire allowance exhausted by a single chat session. If one conversation wipes out more than half of my quota, the “Ultra” label is misleading, and the service becomes practically unusable for any real-world tasks.
Comparative Observations:
-
Ultra vs. Pro Paradox: Paradoxically, the quota behavior for Claude on the Pro subscription seems far more reasonable. It is absurd that an Ultra subscriber effectively receives less usable utility per dollar than a Pro subscriber. Currently, the Ultra subscription’s Claude quota is performing worse than the Pro tier.
-
Gemini Stability: In stark contrast, the Gemini models (including Gemini 3 Pro) on the same Ultra account are functioning correctly. Their quota consumption aligns with expectations, allowing for extended coding sessions without premature exhaustion. This confirms the issue is isolated to the Claude integration on the Ultra plan.
Discrepancy with Value Proposition:
The current behavior contradicts the fundamental promise of the Ultra tier: high volume usage. Paying a premium for “Ultra” implies the ability to run multiple long-context sessions or complex agents. Having 60% of the quota vanish after one prompt suggests a severe bug in calculation logic or a misconfiguration specific to UTC+8 regions.
Request:
Could the team please provide an immediate explanation?
-
Is this a known bug causing massive over-charging for UTC+8 users?
-
Why does a single chat consume 60% of the quota when the expectation was to support dozens of such sessions?
-
Why does the Pro tier currently offer better effective capacity for Claude than the Ultra tier?
I chose the Ultra plan specifically for its promised scale. The current situation, where a single interaction drains the majority of my resources, is unacceptable. I hope for a swift investigation and a fix to restore the service levels I paid for.
Environment Details:
-
Subscription: Google AI Ultra
-
Region/Timezone: UTC+8
-
Affected Models: Claude (Sonnet/Opus)
-
Working Models: Gemini Series
-
Specific Symptom: ~60% quota consumed per single conversation start.
-
Onset Date: Earlier this week (approx. March 22–25, 2026)
Thank you for your urgent attention to this matter.
Best regards