Opus 4.6 output just got capped to 64k since last night, is 4.7 about to drop in the IDE?

Since last night Claude Opus 4.6 in Antigravity keeps hitting “The model’s generation exceeded the maximum output token limit” on tasks that were completing fine before.

What actually happens is worse than a simple cutoff. The model generates its full response, hits the limit, sees the error, and then tries a different approach to fit within the cap. That fails too. So it retries again, and again, until it eventually has to strip down and compromise the entire response just to squeeze under the limit. The output you end up with is a degraded version of what the model originally intended. Opus 4.6 natively supports 128k output tokens but Antigravity seems to be capping it at around 64k now.

The timing is what got me curious. Opus 4.7 launched on April 16 and every other major coding tool already has it. Antigravity still shows 4.6 in the model picker. But 4.6 getting capped right after 4.7 goes live everywhere else feels like backend prep work, config changes before swapping to the new model.

Calling it: 4.7 is about to drop in the IDE. Anyone else seeing the same cap since last night?

Yes! This exact same thing has happened to me multiple times since last night. It is incredibly frustrating watching it get stuck in that retry loop trying to compress everything down.

And on top of the output getting completely degraded, the generation speed has gotten SOOOOO SLOOOOW while it fights with the limits. I really hope your theory about them prepping the backend for Opus 4.7 is right, because right now 4.6 is barely usable if it keeps throttling like this.