Thinking ate all the tokens and hit MAX_TOKENS

NeonByteNomad · December 3, 2025, 8:14pm

Is this bug or it can happen?

```json
{
“candidates”: [
{
“content”: {
“parts”: ,
“role”: “model”
},
“finishReason”: “MAX_TOKENS”,
“safetyRatings”: [
{
“category”: “HARM_CATEGORY_SEXUALLY_EXPLICIT”,
“probability”: “NEGLIGIBLE”,
“blocked”: null
},
{
“category”: “HARM_CATEGORY_HATE_SPEECH”,
“probability”: “NEGLIGIBLE”,
“blocked”: null
},
{
“category”: “HARM_CATEGORY_HARASSMENT”,
“probability”: “NEGLIGIBLE”,
“blocked”: null
},
{
“category”: “HARM_CATEGORY_DANGEROUS_CONTENT”,
“probability”: “NEGLIGIBLE”,
“blocked”: null
}
],
“citationMetadata”: {
“citationSources”:
},
“tokenCount”: null,
“index”: 0,
“avgLogprobs”: null,
“groundingAttributions”: ,
“groundingMetadata”: null,
“logprobsResult”: null,
“urlRetrievalMetadata”: null
}
],
“promptFeedback”: null,
“usageMetadata”: {
“promptTokenCount”: 17759,
“candidatesTokenCount”: null,
“totalTokenCount”: 83294,
“cachedContentTokenCount”: null,
“toolUsePromptTokenCount”: null,
“thoughtsTokenCount”: 65535,
“promptTokensDetails”: [
{
“tokenCount”: 17759,
“modality”: “TEXT”
}
],
“cacheTokensDetails”: ,
“candidatesTokensDetails”: ,
“toolUsePromptTokensDetails”:
},
“modelVersion”: “gemini-2.5-pro”
}
```

Sonali_Kumari1 · December 9, 2025, 10:25am

Hi @NeonByteNomad , Thanks for reaching out to us.

For Gemini 2.5 Pro, You can explicitly set lower thinkingBudget like thinking_budget=1024in your request configuration to prevent the model’s internal reasoning from consuming all the tokens.

Additionally, you can also try Gemini 3 pro, which has thinking_level parameter, to control the maximum depth of the model’s internal reasoning process. You can set it to low to minimize token usage.

Topic		Replies	Views
Big Problem! Gemini 3.0 pro preview thought token exceeding problem Gemini API bug , api , gemini , thinking	7	460	December 26, 2025
`max_output_tokens` isn't respected when using `gemini-2.5-flash` model Gemini API bug	7	641	October 4, 2025
Latest @google/genai with 2.5 flash ignoring thinking budget Gemini API generative-ai , gemini-flash	11	493	December 2, 2025
Gemini 2.5 API bug: missing finishReason when max token limit is reached Gemini API api , gemini-api	1	1023	April 30, 2025
2.5 Flash down recently due to thinking tokens Gemini API help_request , gemini-flash	4	308	June 25, 2025

Thinking ate all the tokens and hit MAX_TOKENS

Related topics