For the past 3 hours, I have been trying to run my code, and I am always getting this error:
ServerError: 503 UNAVAILABLE. {‘error’: {‘code’: 503, ‘message’: ‘The model is overloaded. Please try again later.’, ‘status’: ‘UNAVAILABLE’}}
I am using gemini-2.0-flash. Anyone else having similar problems?
EDIT: It is working now after 16 hours. Google, please scale your models according to your consumer base.
Hi there! Any updates about this topic? Our new feature is ready to go to production except for this issue. It’s not handling tests, imagine in production, where approximately 800000 requests will be made per month.