Low rate limits batch embeddings Tier 3

Ralph_S · January 27, 2026, 8:10am

Hello,

I want to embed a very large dataset (>60 billion tokens) using gemini-embeddings-001 using the batch embeddings to save costs.

I’m on Tier 3, but the rate limit remains at 10,000,000 tokens enqueued. This seems low compared to other models (e.g. Gemini 3 Pro Preview is at 1,000,000,000) and to the . I have filled in the form to receive higher rate limits but get no response. How do I receive higher rate limits?
Even though I have just under 10 million tokens enqueued, I don’t see my usage in the rate limits in the Quota overview on Google Cloud. Is this a known bug?

Topic		Replies	Views
Hitting rate limit on Gemini Batch API for gemini-embedding-001 Gemini API gemini-embedding	2	405	October 8, 2025
Immediate 429 from batch embedding endpoint Gemini API api , gemini	6	149	January 13, 2026
Rate limits / batch processing Gemini API gemini-api	4	185	February 28, 2026
API rate limiting issues Gemini API api , billing	2	200	January 7, 2026
Incorrect tier's rate limits being applied? Gemini API api , rate-limits	5	387	October 1, 2025