How can I know how much tokens are generated from Gemini model from OpenAI SDK

Suparva · April 30, 2025, 8:31pm

How to get the “OUTPUT” tokens count from an streamed response of Gemini from OpenAI SDK.

Siva_Sravana_Kumar_N · May 1, 2025, 5:43pm

You can check the response object to get the number of tokens used. The output looks like this

ChatCompletion(id=None, choices=[Choice(finish_reason='stop', index=0, logprobs=None, message=ChatCompletionMessage(content='I am doing well, thank you for asking! How are you today?\n', refusal=None, role='assistant', annotations=None, audio=None, function_call=None, tool_calls=None))], created=1742584713, model='gemini-2.0-pro-exp-02-05', object='chat.completion', service_tier=None, system_fingerprint=None, usage=CompletionUsage(completion_tokens=15, prompt_tokens=14, total_tokens=29, completion_tokens_details=None, prompt_tokens_details=None))

But in case before supplying to the Gemini if you want to know the number of tokens you can use count_tokens api and count tokens before sending into the model.

Suparva · June 1, 2025, 9:43am

Counting tokens before sending to Gemini isn’t this increasing the load and delaying my users experience. Even a millisecond matters.

Topic		Replies	Views
Is there a calculation method or library of tokens used by Gemini? Google AI Studio	6	281	May 10, 2024
Input/output tokens telemetry/usage metrics Gemini API gemini-15 , api , models	1	57	February 10, 2025
Understanding Token Counts Gemini API models , prompt	2	103	February 26, 2025
Where/how do I find remaining tokens/requests count after making a request? Google AI Studio docs , ai	0	170	September 30, 2024
Có gì đó không đúng khi tính toán tokens_total khi call Gemini model thông qua OpenAI SDK Gemini API ai-studio , api	2	27	June 25, 2025

How can I know how much tokens are generated from Gemini model from OpenAI SDK

Related topics