Billing SKU mismatch: gemini-2.5-flash-lite input charged correctly but output charged as "gemini 2.5 flash" (not lite)

user4485 · January 20, 2026, 3:02am

1. Rate Limits Page (Correct)

The Google AI Studio Rate Limits page shows I’m using gemini-2.5-flash-lite:

Model: gemini-2.5-flash-lite
Category: Text-out models
RPM: 1 / 4K
TPM: 34.08K / 4M

2. Billing Report (Mismatch)

SKU	Usage	Cost
`gemini 2.5 flash lite short input text`	92,797 tokens	$0.29
`gemini 2.5 flash short output text non-thinking`	123,298 tokens	$1.55

The output cost of $1.55 for 123,298 tokens equals ~$12.57/1M tokens, which is much higher than:

Expected Flash Lite output: $0.40/1M
Even Flash output: $2.50/1M

My Code

from google import genai

client = genai.Client(api_key=api_key)
model_name = "gemini-2.5-flash-lite"  # Also tried "models/gemini-2.5-flash-lite"

response = client.models.generate_content(
    model=model_name,
    contents=contents
)

Environment

SDK: google-genai (latest version)
Model: gemini-2.5-flash-lite
API: Google AI Studio (not Vertex AI)

Questions

Why is the output being billed under a different SKU (gemini 2.5 flash) than the input (gemini 2.5 flash lite)?
Is there any additional configuration needed to ensure output tokens are also billed at the Flash Lite rate?
Is this a known billing system issue?

Thank you for your help!

Topic		Replies	Views
Bug: Gemini 2.5 Flash Lite candidatesTokenCount underreports output tokens by ~8.5x vs actual billing Gemini API ai-studio , bug , gemini , gemini-flash-2-5	3	189	February 25, 2026
Billing discrepancy: detailed token usage and pricing info Gemini API gemini-flash , billing	7	700	July 17, 2025
Batch API Gemini 2.5 Flash charges are abnormally excessive in AI Studio Gemini API billing	5	175	March 17, 2026
Unexpected ‘Number of videos generated’ billing for Gemini API text requests Gemini API api , gemini	1	68	October 6, 2025
Why is the charge different from what I calculated? Gemini API api , gemini-flash	1	169	June 25, 2025

Billing SKU mismatch: gemini-2.5-flash-lite input charged correctly but output charged as "gemini 2.5 flash" (not lite)

1. Rate Limits Page (Correct)

2. Billing Report (Mismatch)

My Code

Environment

Questions

Related topics