Billing discrepancy: detailed token usage and pricing info

Cheshire_Cat · June 22, 2025, 10:36am

I’m using gemini-2.5-flash-preview-05-20 (as I started my current project before the 06-17 version and want consistent results) with the Gemini API. However, I’m having trouble understanding my billing.

According to Google AI Studio, the input/output pricing for non-thinking mode should be $0.15/$0.60 per 1M tokens.
Based on my estimates, I’ve processed approximately 48M input tokens and 30M output tokens, so I would expect to be charged around $26. However, my current billing is close to $60.
I’m pretty sure I’m missing something, but I can’t identify what it is.

Would it be possible to know (or confirm) the exact pricing for this model and also the actual usage (# tokens)?
Thank you!

Deepakishore · June 23, 2025, 8:37am

Hey @Cheshire_Cat — that does sound confusing. If your token estimates are accurate, the billing seems off. It’s best to check the detailed usage breakdown in your Google Cloud Console and confirm if any extra charges (like thinking mode or retries) are being applied. Reaching out to support might help clarify the pricing for your specific model version.

Thanks!

Cheshire_Cat · June 23, 2025, 9:59am

Hi @Deepakishore, thank you for your reply.
I tried to check the detailed pricing breakdown in the Google Cloud Console, and I found that (with still some minor discrepancies), I’m (probably) being charged according to the gemini-2.5-flash pricing, even though I’m using the preview version.

I suppose that the 05-20 version has been replaced by the stable release, however in Google AI Studio I now see only the preview-04-17 version, even though here it is still mentioned the preview-05-20 version.

In any case, I didn’t expect to be charged stable pricing while using a preview version.

Lalit_Kumar · June 27, 2025, 9:58am

Hello,

You should only be charged for the model which you are using, so just to verify are you using gemini-2.5-flash-preview-05-20 only or did you switch to any other version?

Cheshire_Cat · June 27, 2025, 9:03pm

Hi @Lalit_Kumar, thank you for your reply.
I confirm that I’ve always been using gemini-2.5-flash-preview-05-20. However, based on the detailed pricing, it seems that I’ve been charged for gemini-2.5-flash.

I’m not entirely sure about this, as the pricing breakdown in the Google Cloud Console doesn’t specify the exact model version, in the SKU column it only says “Generate content output token count gemini 2.5 flash short output text non-thinking”.
By dividing the charged amount (̀~$62) by the token count (~28.5M), the result (~$2.2) is closer to the pricing for gemini-2.5-flash (i.e., $2.5) than to the current preview version (i.e., $0.6).

Lalit_Kumar · July 2, 2025, 6:29am

Hello!

To help me understand, could you please clarify if the token count you provided includes both the input and output tokens, or just the output tokens?

Cheshire_Cat · July 3, 2025, 10:57am

Hi, here’s the detailed pricing breakdown:

SKU	Usage	Cost
Generate content output token count gemini 2.5 flash short output text non-thinking	28.511.784 count	~$62
Generate content input token count gemini 2.5 flash short input text	57.630.552 count	~$16

If I calculate the price per M tokens, I get ~$2.2 for output and ~$0.3 for input, which seems more aligned with the pricing for gemini-2.5-flash (i.e., $2.5 for output and $0.3 for input) than with the current preview version (i.e., $0.6 for output and $0.15 for input), even though I’ve always been using gemini-2.5-flash-preview-05-20 in my experiments.

Krish_Varnakavi1 · July 17, 2025, 10:51pm

Hi @Cheshire_Cat,

Thanks for reporting this issue..

For all billing related issues or discrepancies, we have an official channel to report such issues using this link

While my colleague @Lalit_Kumar is kind enough to investigate this, he has to go through the same channel.. So opening a direct communication with billing can help speed-track the response.

Thanks for your cooperation!

Topic		Replies	Views
Billing Discrepancy for Gemini 2.5 Pro Usage Gemini API billing , gemini-2-5	2	292	August 8, 2025
Billing SKU mismatch: gemini-2.5-flash-lite input charged correctly but output charged as "gemini 2.5 flash" (not lite) Gemini API gemini-api	0	26	January 20, 2026
Gemini 2.5 Model Bug Causing Massive Bills, Google Support Unresponsive to Core Issue Gemini API billing , gemini-flash-2-5	10	1237	January 29, 2026
Why is the charge different from what I calculated? Gemini API api , gemini-flash	1	141	June 25, 2025
Thinking Tokens Counted, but Billed as Non-Thinking Gemini API api , billing	1	347	April 24, 2025

Billing discrepancy: detailed token usage and pricing info

Related topics