Claude agents (Sonnet/Opus 4.6 thinking) frequently hit “maximum token output exceed” error prematurely on long generations — recent regression, loops with retry

Sangram_Pattanayak · May 3, 2026, 6:23pm

I’m experiencing a consistent issue in Google Antigravity IDE with Claude-powered agents (especially Sonnet 4.6 and Opus 4.6 thinking modes).

Problem:

The agent starts thinking normally, generation proceeds slowly, then it suddenly throws “The model’s generation exceeded the maximum output token limit” (or similar “maximum token output exceed”).

This happens even when the UI shows plenty of output tokens/context remaining and when generating long files/code that previously completed successfully in one pass.

The agent then retries automatically, often looping with the same error.

It used to handle truly long outputs reliably before recent updates.

Topic		Replies	Views
The model's generation exceeded the maximum output token limit Google Antigravity rate-limits	13	1176	May 29, 2026
Antigravity caps all models at 64k output tokens, but Claude Opus natively supports 128k Google Antigravity models , rate-limits	0	327	April 19, 2026
Latest update broke output limits, confirmed by downgrade Google Antigravity bug , models	0	98	April 23, 2026
Google Antigravity has a bug that burns 10x your tokens and nobody at Google will acknowledge it exists Google Antigravity feedback , bug	1	175	May 3, 2026
Agent terminated error Google Antigravity bug	6	218	May 9, 2026

Claude agents (Sonnet/Opus 4.6 thinking) frequently hit “maximum token output exceed” error prematurely on long generations — recent regression, loops with retry

Related topics