Hi everyone,
I’ve been looking into the billing of Gemini 2.5 Flash, and the output tokens bill makes me curious, so I have a few questions, maybe I misread the documentation. The documentation says this:
Output price (including thinking tokens) - $2.50 per million tokens in USD
Questions
- When
thinkingBudget > 0
, are thinking tokens always accumulated tooutputTokens
for billing purposes or the API doesn’t charge us for usingthinking tokens
? - If I set
thinkingBudget = 0
(i.e. disable thinking), do I still pay the same $2.50 /M rate for the remaining output tokens? - Is there any scenario where thinking tokens are free or billed at a different rate?
Pricing page for reference: Gemini Developer API Pricing | Gemini API | Google AI for Developers
Thanks in advance for clarifying!
— Allan