Severe Latency Inversion on Paid Tier 2 (Priority) using Gemini 3 Flash (Preview) compared to Standard Tier

suisuisuuuui · June 9, 2026, 9:16am

Hello,

We recently upgraded to Paid Tier 2 (AI Studio) and configured service_tier="priority". We verified that the response header returns x-gemini-service-tier: priority correctly.

However, we are experiencing a severe performance inversion when sending multimodal requests (including images resized to 1024px long-edge):

Standard Tier (Free): Completed in approx. 30 seconds.
Priority Tier (Paid): Takes 80 to 356 seconds (95th percentile latency hits 202,072 ms in Google Cloud Console), accompanied by frequent 503 Service Unavailable errors.

Cloud Support (Case 72095741) has already reviewed our case and confirmed that this 80-356s delay is “significantly outside the performance targets for the Gemini 3 Flash model on the Priority tier.”

Since we have already optimized everything on the application side (minimizing token/image size, adjusting concurrency), this seems to be a backend routing or quota synchronization bug unique to Paid Tier 2 after the upgrade.

Is anyone else experiencing this issue with the preview models on Priority? Any insights from the Google engineering team would be highly appreciated.

Topic		Replies	Views
503 - with gemini Priority inference Gemini API bug , api , gemini	2	46	June 1, 2026
Paid Tier 2 project: consistent 503 on gemini-3-pro-image-preview (Nano Banana Pro) while Tier 3 in same billing succeeds 100% — reproducible diagnostic Gemini API gemini-3	1	69	May 13, 2026
Noticeable 503 errors and latency difference between older 5months vs newer 2months paid API keys Gemini API api , gemini	0	8	June 11, 2026
Persistent 503 Server Overloaded errors on gemini-3.1-flash-image-preview – Tier 1 Paid Account Gemini API gemini-3	0	147	March 20, 2026
Sudden drastic degradation in latenecy and error rates with Gemini 2.0 Flash Gemini API api , gemini-api , gemini-20	1	204	February 28, 2025

Severe Latency Inversion on Paid Tier 2 (Priority) using Gemini 3 Flash (Preview) compared to Standard Tier

Related topics