Hitting quota limit suddenly, have payment methods and everything setup - maybe I'm stupid, I wouldn't know

adam1e02818 · June 25, 2025, 9:35pm

Hey everyone, if I’m posting in the wrong place, forgive me, I’ve been searching for a solution all day.

So I’ve been using, even the basic pre-built Conversational web app in Google AI Studio and suddenly I’m hitting a quota limit. Payment accounts are sorted, billing account is good, and we have credits sorted.

I’m really stumped - using gemini-2.5-flash-preview-native-audio-dialog - contacted Google Billing and they sent me here… Really hope someone can help!

Best regards,
Adam

Krish_Varnakavi1 · June 25, 2025, 10:06pm

Hi @adam1e02818,

Welcome to the Google AI Forum!

Please go to GCP console and click “APIs & Services”. Under Metric, search and select “Generative Language API”.. Under “Quotas & System Limits” tab, check for “Current Usage percentage”..

If it reaches 100%, then you have reached your quota limits and hence might get 429 Error.

If you think that there is any discrepancy, please DM me with a clear error message and Project ID to help us investigate further.

adam1e02818 · June 25, 2025, 10:26pm

Hey Krish, so I looked an we’re not exceeding any quotas. There are no tools exceeding quotas, but we have a lot of items like these with 0 marked. I haven’t changed anything in here for the issue to start though, I’m not sure if something might have changed without my knowing.

Please see pic attached

Krish_Varnakavi1 · June 25, 2025, 10:52pm

Hmm.. I don’t see gemini-2.5-flash-preview-native-audio-dialog model listed in the attached screenshot and you have no usage in this account.. Can you reconfirm if you are checking stats from the same account that was used to test native-audio model?

KichangKim · June 26, 2025, 2:17am

Similar issue here (but solved now).

Few days ago, gemini-2.5-flash return quota limit error and google cloud dash board shows 99% usage of token per min limit. However, it was impossible to restrict the API because I had not used the Gemini Flash API for a few days and it is first call of that day (of course my request does not have 1M tokens). After some time, the error disappeared naturally, but I think this is clearly an API server error.

It seems that gemini-2.5 family is still extremely unstable …

Krish_Varnakavi1 · June 26, 2025, 4:20am

hi @KichangKim,

Thanks for sharing your experience..

We are rapidly evolving our models and had 2 releases within last 10 days.. During release, we’re aware of the server limit issues due to increased load during releases.. May be your usage fell during such time..

However if you face token limit issue again and you believe there is a discrepancy, feel free to DM me with your Project ID to help us troubleshoot.

adam1e02818 · June 27, 2025, 2:11am

Hey Krish, sure so. It’s the same account in Google AI Studio as Google Cloud console, I can’t see any differences. I’ve requested a quota uplift so I’ll speak to Sales people to see how they can help.

Thanks for looking into this, by the sounds of other comments it’s a recurring issue. Hopefully can find a way to fix.

Cheers,
Adam

Krish_Varnakavi1 · June 27, 2025, 10:35pm

Sounds good.. I hope this gets resolved for you soon..

Tony_Carnevale · June 28, 2025, 7:19pm

Any resolve on this? I have the EXACT same issue as Original Poster using that pre built one, you go to add your API and it lets you barley use anything. Nothing is out of limits per the console either.

Krish_Varnakavi1 · June 30, 2025, 9:08pm

@Tony_Carnevale,

Welcome to the Google AI Forum!

I will escalate your issue with the concerned team.. Can you DM me your Project ID..

Balaji_Chellappa · July 2, 2025, 5:50am

Hi Krish,
I have slightly different issue. I have billing enabled, using it for more then 3 months and exceeded $300 spent. But still struck with Tier 1.
Through IAM quota submitted request to increase limit 3 times and it got rejected.
Any help to promote our account to Tier 2.
1000 RPD for the gemini-2.5-pro model is too low for our use case.
Thanks
Balaji

adam1e02818 · July 2, 2025, 1:00pm

So from talking with my sales guy at Google, he basically recommended migrating to Vertex AI from Google AI Studio as the quota limits are just too low regardless of tier - this is because it’s a free service at point of access (like a teaser before you jump in and pay to play).

So I’m now enjoying the Google Cloud puzzle game where I’m breaking things repeatedly and wrapping my head around documentation. The shame for me is that we’re using the Voice AI for a bidirectional conversation, which there seems to be little information on setting up and such.

I hope there can be relevant documentation simply laid out to solve these increasingly used-by-noobs services soon.

Cheers & hope you’re all finding some useful info out.

Krish_Varnakavi1 · July 8, 2025, 12:35am

Hi @adam1e02818,

You are right.. Vertex AI is designed to build production quality services and have much more flexibility w.r.t customizing models as well.. I’m glad you got this info and heading in the right path.

Can you be specific on use-cases that you are trying to implement and areas where you feel documentation needs to be improved?

saqlain_ahmed · August 5, 2025, 11:19am

can you explain , why we are getting this error despite having tier 1 , we just hit for i think 5-6 only

laniko · August 10, 2025, 6:05am

I recently forked the AI Studio demo for the Gemini Live API and noticed some inconsistencies with the quota limits. According to the “Quotas and Limits” page, the free tier for the Gemini API is listed as 5 requests per minute (RPM). However, in my AI Studio dashboard, it shows a limit of 50 RPM, and my account is marked as Tier 1. This discrepancy is confusing, and I’d appreciate clarification on how these limits are applied (e.g., per API key, per project, or per account). Screenshots of my dashboard and the limits page are attached for reference.

While the Gemini Live API is impressive, the current rate limits feel restrictive for prototyping and development. For comparison, other LLM providers offer simpler API access with higher limits, making it easier to build and test applications quickly. The process of navigating tiers and quotas in Google Cloud feels like an unnecessary hurdle, detracting from an otherwise excellent product.

Could someone from the Google team clarify the following:

Why does my AI Studio show 50 RPM and Tier 1 while the documentation indicates 5 RPM for the free tier? Is this a bug or an intentional difference?
What are the steps to request a quota increase for the Gemini Live API to support more robust prototyping? I would like to be on Tier 3.
Does Vercel AI provide access to the Gemini Live API with higher limits, or is it subject to the same restrictions as Google’s platform?
When do these limits rest ?

It would be incredibly helpful if Google could streamline the process for accessing and scaling API limits to make development more seamless. Any guidance on how to resolve these issues and get back to building would be greatly appreciated.

I have demo to show with Gemini Live API on monday but now im worried i cant improve the product in fier of getting iced out of actually showing the demo to my audience.

Thank you!

Krish_Varnakavi1 · August 21, 2025, 10:53pm

Hi @laniko,

Welcome to the Google AI Forum!

I will try to answer all questions.. If you are looking for more context, feel free to post a follow-up.

Free tier limit: 5 RPM for Gemini 2.5-pro.. Everyone starts with this tier.
Tier 1: When you link a billing account to your Google Cloud project, you are automatically moved to “Tier 1”. Rate for 2.5-pro is 150 RPM (not 50).. If it’s not a typo, do let me know.

Here is the doc for rate limits.. Tier upgrades does happen automatically… If you want custom tiers or limits more than Tier-3, you can request it using “Request paid tier rate limit increase” button at the bottom of this page.

While the Vercel AI SDK is an excellent tool, Vercel itself does not provide a separate, higher quota for the Gemini Live API. Your usage of the Gemini API through Vercel will still be subject to the rate limits set on your Google Cloud project.

Your daily quotas for the Gemini API reset at midnight Pacific Time (PT).
Rate limits are applied at the project level, not per API key.

I understand your concern about the upcoming demo. The 150 RPM limit of Tier 1 is significantly more accommodating for prototyping than the 5 RPM of the free tier. As you prepare, I would recommend monitoring your usage within the Google Cloud console to ensure you stay within your limits.

Sourabh · September 14, 2025, 7:00am

hey man, facing the same issue, my project id is - gen-lang-client-0689013278
i’ve attached my billing account, i’m on tier 1, in the metrics it’s not showing that i’ve reached the limit but in my project it’s showing "Failed to generate scene. The content may have been blocked or an API error occurred. Details: {“error”:{“code”:429,“message”:“You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits.",“status”:"RESOURCE_EXHAUSTED”

please help me out with it.

Sunpreet_cape · September 30, 2025, 2:58am

Just a little nugget for everyone using AiStudio and hitting rate limits inside aistudio but not on deployed apps.
There is a key icon in the top right of the IDE with a cross through it. Choose that and select your projects key in order to use the api with the higher RPM.

Seth_Ford · December 30, 2025, 4:00am

Same issue here I am trying to do use the 2.0 exp model because I can’t seem to get the performance out of 3.0 and I am trying to to realtime streaming. But I am getting rated limited even though I have a billing account setup and can’t find where the limits are kicking me.

HaziqFaris · March 2, 2026, 2:18am

I think the issues are still being persist until today

Topic		Replies	Views
CRITICAL BUG: Paid Project (Tier 1) but stuck on Free Tier Token Limit Gemini API api , google-cloud , billing	86	4174	May 11, 2026
Tier 1 billing enabled but stuck on free quotas Gemini API ai-studio , api , gemini , billing	2	1145	February 10, 2026
You exceeded your current quota, please check your plan and billing details Gemini API api , billing	10	6893	March 6, 2026
429 error with quota with tier Gemini API ai-studio , api , gemini	45	2236	March 12, 2026
Quota exceeded error on Tier 1 paid project despite minimal usage Gemini API ai-studio , billing , rate-limits	9	695	May 27, 2026

Hitting quota limit suddenly, have payment methods and everything setup - maybe I'm stupid, I wouldn't know

Related topics