Có gì đó không đúng khi tính toán tokens_total khi call Gemini model thông qua OpenAI SDK

hung_hoang_dinh · June 25, 2025, 8:05am

I am developing a product that uses gemini_api_key and uses it through openai sdk. To calculate the cost, I get prompt_tokens, complete_tokens and total_tokens in each api call. According to my calculation, prompt_tokens+complete_tokens=total_tokens. However, during the calculation, I noticed that total_tokens is much larger than the two parameters above. I have used two models, gemini_2.5_pro and gemini_2.5_flash, and used the output structure. Is there any mistake here? Or does Google include token thinking in the output?

Shrushti_Patil · June 25, 2025, 8:52am

Hi @hung_hoang_dinh ,
Refer - Understand and count tokens | Gemini API | Google AI for Developers
Thanks!

Jun_Yellow · June 25, 2025, 9:33am

I know this because there’s also a reasoning token. When using the OpenAI format, the complete_tokens doesn’t include the reasoning process, which should be counted in the output for billing. It’s probably a minor compatibility issue.

Topic		Replies	Views
2.5 Pro Preview API: thoughts pricing Gemini API api , prompt	1	166	June 5, 2025
Billing discrepancy: detailed token usage and pricing info Gemini API gemini-flash , billing	2	44	June 23, 2025
How can I know how much tokens are generated from Gemini model from OpenAI SDK Gemini API api	2	59	June 1, 2025
Understad token count Gemini API api , prompt	4	155	February 27, 2025
Gemini 1.5 Pro charges x6 more tokens than expected on text prompts Gemini API gemini-15 , bug , api , gemini-api , gemini	8	217	June 10, 2024

Có gì đó không đúng khi tính toán tokens_total khi call Gemini model thông qua OpenAI SDK

Related topics