Incorrect gemini-2.5-pro input token limit response

DeS · October 28, 2025, 11:47am

I am trying out gemini-2.5-pro on gemini api instead of google vertex ai because it produces too many unpredictable resource exhausted limits.

And usually my requests passed (with exception of some 503 errors so i guess this endpoint is also not free from resource exhausted errors), however, one of the answers from the server was:

Gemini API error: {
  "error": {
    "code": 400,
    "message": "Unable to submit request because the input token count is 106244 but model only supports up to 65536. Reduce the input token count and try again. You can also use the CountTokens API to calculate prompt token count and billable characters. Learn more: https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models",
    "status": "INVALID_ARGUMENT"
  }
}

But none of the modles have such small input token limit https://ai.google.dev/gemini-api/docs/models all the models have output limit of 65536, not input tokens

What could be wrong?

bsuth_restar · October 31, 2025, 4:34am

I am also getting the same error when using gemini-2.5-pro via Vertex AI (paid plan). However, I am not getting the error when using gemini-2.5-flash with the exact same input tokens

Reid_Greer · November 6, 2025, 5:51am

I’m also hitting this error when using videos understanding. I assumed the error was just miswritten but reducing the FPS input (and thus the input tokens) made the error go away.

Topic		Replies	Views
Token limit error (1.5 Pro and Flash) Gemini API gemini-15 , models	6	2104	March 25, 2025
Gemini 2.5 Pro (paid, with API key) errors 100% of the time when token count is over 131k Gemini API bug , gemini-25 , gen-ai	7	440	December 19, 2025
Sorry, I hit a snag Gemini API gemini , prompt	2	234	October 20, 2025
The input token count (1337419) exceeds the maximum number of tokens allowed (1048576) Gemini API api , gemini	1	155	October 21, 2025
Cannot send over 32000 input tokens (on Paid plan) Gemini API gemini-15 , api	4	385	June 10, 2024

Incorrect gemini-2.5-pro input token limit response

Related topics