Hitting quota limit suddenly, have payment methods and everything setup - maybe I'm stupid, I wouldn't know

I recently forked the AI Studio demo for the Gemini Live API and noticed some inconsistencies with the quota limits. According to the “Quotas and Limits” page, the free tier for the Gemini API is listed as 5 requests per minute (RPM). However, in my AI Studio dashboard, it shows a limit of 50 RPM, and my account is marked as Tier 1. This discrepancy is confusing, and I’d appreciate clarification on how these limits are applied (e.g., per API key, per project, or per account). Screenshots of my dashboard and the limits page are attached for reference.

While the Gemini Live API is impressive, the current rate limits feel restrictive for prototyping and development. For comparison, other LLM providers offer simpler API access with higher limits, making it easier to build and test applications quickly. The process of navigating tiers and quotas in Google Cloud feels like an unnecessary hurdle, detracting from an otherwise excellent product.

Could someone from the Google team clarify the following:

  1. Why does my AI Studio show 50 RPM and Tier 1 while the documentation indicates 5 RPM for the free tier? Is this a bug or an intentional difference?

  2. What are the steps to request a quota increase for the Gemini Live API to support more robust prototyping? I would like to be on Tier 3.

  3. Does Vercel AI provide access to the Gemini Live API with higher limits, or is it subject to the same restrictions as Google’s platform?

  4. When do these limits rest ? :frowning:

It would be incredibly helpful if Google could streamline the process for accessing and scaling API limits to make development more seamless. Any guidance on how to resolve these issues and get back to building would be greatly appreciated.

I have demo to show with Gemini Live API on monday but now im worried i cant improve the product in fier of getting iced out of actually showing the demo to my audience.

Thank you!