I am getting this error, and I’m not sure what it means. It’s a 503, suggesting it is a server-side error. I’m using the free tier. I’m within the rate limits of 15 calls/minute and 1 million tokens / minute. I’m using the JavaScript/TypeScript SDK.
As the 50x level indicates - this is an error on Google’s side. It usually means that something went wrong in how they’re dealing with something internally. Sometimes in calling a parallel service to Gemini, but not Gemini itself.
I’m seeing more and more reports of something like this. Can you provide more details or a concrete example of code you’re calling that is is triggering it? Are you using any tools? Large prompts? Media?
In addition to a user text prompt, I’m using cached context with a large PDF and system instructions. I’m trying to accurately extract information from research studies.
Try using Gemini’s experimental model “gemini-exp-1114”. I believe that Google allocated too many resources to the experimental model, possibly due to an imminent update.
Everything was overloaded yesterday. I just checked it 1 minute earlier and everything seems to work fine now.
I hope the issue is resolved for you all too??
I am also getting this error . I am using langchain and it is traversing the pdf and giving me answer with the help of gemini . But when I am asking the question sometimes it is giving me this error
I just switched from flash-001 to 002 and now these error are showing. Retry policies let these pass eventually in my case but the errors are pretty often.