Token counting in Google AI Studio Playground vs API

Steven_Sang · January 25, 2026, 2:03am

I’ve noticed a potential discrepancy in input token counting for multiple rounds of chatting. In Google AI Studio’s Playground, the displayed cost estimation seems to only count user’s prompts as input tokens. However, when using API, my understanding is that all previous conversation history, including both user and model tokens, are counted as input in each round of chatting. I’m I missing something?

Logan_Kilpatrick · January 25, 2026, 5:45pm

Hey Steven, agreed on this. I proposed the following to the team:

We have a single request cost estimate, what would it cost to recreate in a single turn exactly what is shown in the chat UI.
We have a multi-turn request estimate, if this was an iterative conversation, here’s how much it would have cost

Will get this actioned so that it is more clear!

Steven_Sang · January 26, 2026, 9:04pm

Hey Logan, thanks for the reply - that’s sounds good and I look forward to that!
A follow-up question: I notice that there are many encrypted thought tokens, which are counted as part of the output tokens, while using API. I wonder how are those tokens handled in the AI studio’s cost estimator?

Topic		Replies	Views
Understad token count Gemini API api , prompt	4	285	February 27, 2025
Token counting mismatch between AI Studio Playground and API usageMetadata when using Function Calling Google AI Studio ai-studio , billing , function-calling	1	180	January 6, 2026
How does AI Studio calculate "Token Usage" vs. Actual Context Window? Google AI Studio ai-studio , gemini-3	0	146	March 17, 2026
Count Token Differences between Vertex AI and Google AI Studio (When contents is multilingual) Gemini API ai-studio , api , vertexai	4	275	August 29, 2025
### 📌 Questions for the Google Gemini API Team Gemini API api	1	171	March 12, 2025

Token counting in Google AI Studio Playground vs API

Related topics