Even the Gemini Flash model is always showing an "exhausted" error

{“error”: {“code”: 503,“details”: [{“@type”: “type.googleapis.com/google.rpc.ErrorInfo”,“domain”: “cloudcode-pa.googleapis.com”,“metadata”: {“model”: “gemini-3-flash”},“reason”: “MODEL_CAPACITY_EXHAUSTED”}],“message”: “No capacity available for model gemini-3-flash on the server”,“status”: “UNAVAILABLE”}}

Hi @VoVoZera

Thank you for sharing your feedback on the server issue.
I have shared the issue with our engineering team to investigate further, and we appreciate your patience as we work toward a resolution.

Same here. I have tokens available, I bought extra capacity, and still having problem to work will al models. I will shift to Codex or any other reliable LLM.