I have been using gemini google API with openAI format through this API
https://generativelanguage.googleapis.com/v1beta/openai/
But suddenly I’m getting this error
RateLimitError: Error code: 429 - [{'error': {'code': 429, 'message': 'Resource has been exhausted
(e.g. check quota).', 'status': 'RESOURCE_EXHAUSTED'}}]
Even though I’m sending only one query at a time with small question "What is the capital of Egypt)
If I used gemini through the google sdk it works just fine. Did I hit the OpenAI format limit?
1 Like
Yes, you did.
Wait a little bit or try again tomorrow.
Do you have any link or documentation on this, like what is the limit ?
We are experiencing the same issue.
We are on paid plan and we are not reaching anywhere close to the quota based on usage statistics.
When we request to /generate_content API, it does not return quota error. But when we request with stream=True, it returns different error.
I am also experiencing the same issue specifically when streaming the response. I am getting frequent 429 errors.
It looks like the action items are:
- Google to provide documentation on the OpenAI format limit.
- Google to investigate the 429 error issue with streaming responses.
1 Like
Hi google team, is there any update on this issue? We have also tried using the latest openai library version but the issue persists.
Using google-generativeai generate_content() with the same key works but the issue only occurs with OpenAI format (and it was working fine last week).
Could you please provide an update?
We have discovered that integrating Gemini is a necessary step.
As OpenAI users, we are delighted to utilize this compatible API. However, we are somewhat disappointed when we achieve satisfactory results in the playground, as we encounter 429 errors.