Is there a way to get rate limits via API?

John_Horton · September 26, 2024, 12:21pm

With OpenAI API for example, the rate limit info is in the header:

return {
                        "rpm": int(headers["x-ratelimit-limit-requests"]),
                        "tpm": int(headers["x-ratelimit-limit-tokens"]),
                    }

Is there an equivalent, ideally through the google.generativeai python sdk?
Context: My account lists a much higher RPM than I can get in reality before hitting the resource exhausted exception.

Guillaume_Vernade · September 26, 2024, 3:20pm

Hello John,

I don’t think there’s a way at the moment but we should build one. There’s a feature request already in the backlog, I’ve nudged it.

Topic		Replies	Views
Where/how do I find remaining tokens/requests count after making a request? Gemini API docs , ai	1	317	October 1, 2024
Where/how do I find remaining tokens/requests count after making a request? Google AI Studio docs , ai	0	161	September 30, 2024
Gemini API Token metrics Gemini API api , feature-request , metrics	0	33	April 16, 2025
5 RPM - Will that be increased in future? Gemini API	4	235	May 2, 2024
Issue with Quota Limit in Gemini API Gemini API	1	338	May 4, 2024

Is there a way to get rate limits via API?

Related topics