Hi everyone,
I am building a RAG pipeline and using models/gemini-embedding-001 to process a relatively small dataset (approx. 65 markdown files).
The Issue: I am encountering persistent 429 Resource Exhausted errors that seem unrelated to my actual usage volume.
-
I successfully processed a small batch of files, but suddenly hit a wall.
-
Currently, I get an immediate
429error on the very first request, even after waiting for hours. -
The
retry-afterheader returns spiraling wait times (e.g., 320s+), and the request never succeeds.
Troubleshooting Steps Taken:
-
I implemented exponential backoff in my code, but it doesn’t help since the initial request is blocked immediately.
-
I tried creating a fresh API Key on a new project to rule out quota issues, but I still receive the same 429 error instantly.
-
I noticed other users reporting similar “phantom” 429 errors recently (e.g., Link 1, Link 2).
My Questions:
-
Is there a known issue or degradation with the
gemini-embedding-001model specifically? -
I am planning to upgrade to the Pay-As-You-Go (Blaze) plan to resolve this. However, seeing other threads where Paid Tier users are also facing 429 errors, I am hesitant. Will upgrading guarantee a fix for this specific error?
Any advice is appreciated.
Thanks.