We are experiencing persistent 429 Too Many Requests errors while using the Gemini API with the gemini-3.1-pro-preview model.
This is happening on a Tier 3 account and is significantly impacting production reliability.
Environment
-
API: Gemini API
-
Platform: Google AI Studio
-
Model:
gemini-3.1-pro-preview -
Usage Tier: Tier 3
-
Authentication: API Key
-
SDK/API Method:
REST
Issue Description
We are receiving frequent HTTP 429 Too Many Requests responses, even though our usage appears to remain within expected Tier 3 limits.
Typical error:
{
"error": {
"code": 429,
"message": "Resource has been exhausted",
"status": "RESOURCE_EXHAUSTED"
}
}
Observed Behavior
-
429s occur repeatedly during normal production traffic
-
Exponential backoff and retries do not consistently recover
-
API reliability has become unpredictable despite Tier 3 access
Mitigations Already Attempted
-
Implemented exponential backoff with jitter
-
Reduced concurrency
-
Lowered request throughput
-
Added retry queues
Questions
-
Are there currently any known capacity or throttling issues affecting
gemini-3.1-pro-preview? -
Are preview models subject to additional dynamic throttling beyond documented Tier 3 quotas?
-
Is there any hidden concurrency/session limit that could trigger 429s?
We would appreciate any clarification or guidance, as this issue is affecting production stability despite operating on a Tier 3 account.