Do thinkingBudget tokens count toward billed output in Gemini 2.5 Flash?

Allan_K · July 11, 2025, 4:04pm

Hi everyone,

I’ve been looking into the billing of Gemini 2.5 Flash, and the output tokens bill makes me curious, so I have a few questions, maybe I misread the documentation. The documentation says this:

Output price (including thinking tokens) - $2.50 per million tokens in USD

Questions

When thinkingBudget > 0, are thinking tokens always accumulated to outputTokens for billing purposes or the API doesn’t charge us for using thinking tokens?
If I set thinkingBudget = 0 (i.e. disable thinking), do I still pay the same $2.50 /M rate for the remaining output tokens?
Is there any scenario where thinking tokens are free or billed at a different rate?

Pricing page for reference: Gemini Developer API Pricing | Gemini API | Google AI for Developers

Thanks in advance for clarifying!

— Allan

Richard_Davey · July 11, 2025, 5:11pm

If the API sends you tokens, you pay for them, at the model specific ‘output’ cost. It doesn’t care where they come from (thinking or response). There’s no ‘thinking token’ cost, it’s all just an output cost, because they’re all just tokens.

Topic		Replies	Views
Pricing for Gemini 2.5 API: With and Without Thinking Option in the Official Release Gemini API billing , thinking , gemini-2-5	5	365	July 18, 2025
Gemini 2.5 pro - cost-token Gemini API billing	3	238	June 27, 2025
Are the thinking tokens counted in the output price for 2.5 Flash? Gemini API thinking , gemini-2-5	1	190	June 13, 2025
Thinking Tokens Counted, but Billed as Non-Thinking Gemini API api , billing	1	269	April 24, 2025
2.5 Pro Preview API: thoughts pricing Gemini API api , prompt	1	192	June 5, 2025

Do thinkingBudget tokens count toward billed output in Gemini 2.5 Flash?

Related topics