It’s been 10 days, i am getting this every time in every way. i also use API in local python and have the same issue. I can only use Flash 2.5 which i don’t need.
I have all limits.
What is the source code to see the API call ? Have you tried asking Gemini to Debug it for you ? Also just a thought but what model do you intend to use if Flash 2.5 is not your preference ? Are you running it locally ?
You will have to provide more than just the errors i.e. information on the environment you’re working with. (For example: Locally using Gemini, or are you in Ai Studio / Firebase? Are you pulling from a Repo of sorts etc.. How are you linking the Model ? Did you Generate credentials and allocate appropriate resource tiers via Ai Studio or Vertex AI or directly via Google Cloud resource manager etc..)
I ask this because my guess is you’re using a free tier of sorts somewhere. You might need to upgrade or allocate more compute / demand depending on how you’ve got it setup and where the app actually draws the model from ? Or are you using a 3rd party of sorts… ?
This result from inside Google Ai studio. I have paid tier and when i tested in local python, i used API from here.


