2.5 Flash Lite Preview number inconsistency

reid · June 18, 2025, 12:30am

In the documentation, 2.5 Flash-lite is stated to have a 1M token window.
In the AI Studio, however, it’s ~65K.
The API method(client.aio.models.list()) states the same thing:

Model Name: models/gemini-2.5-flash-lite-preview-06-17
  Display Name: Gemini 2.5 Flash Lite Preview 06-17
  Labels: None
  Version: 2.5-preview-06-17
  Model Config: {'alias_generator': <function to_camel at 0x00000161D4742340>, 'populate_by_name': True, 'from_attributes': True, 'protected_namespaces': (), 'extra': 'forbid', 'arbitrary_types_allowed': True, 'ser_json_bytes': 'base64', 'val_json_bytes': 'base64', 'ignored_types': (<class 'typing.TypeVar'>,), 'validate_by_alias': True, 'validate_by_name': True}
  Description: Preview release (June 11th, 2025) of Gemini 2.5 Flash Lite
  Input Token Limit: 65536
  Output Token Limit: 65536
  Supported Actions: ['generateContent', 'countTokens', 'createCachedContent', 'batchGenerateContent']
  Checkpoints: None
  Endpoints: None

What’s the actual limit? Is that a bug?

GUNAND_MAYANGLAMBAM · June 18, 2025, 6:48am

Hey @reid , Thank you for bringing this to our attention. The input token limit is 1 million. I will follow up with the team to look into it.

Topic		Replies	Views
Output tokens limit for the finetuned gemini flash 1.5 Gemini API fine-tuning	12	2465	October 12, 2024
Gemini 2.5 Pro and Flash Limits in Google AI Studio UI Google AI Studio ai-studio , api , models	1	90	July 2, 2025
Token count decrease from 1 mil to 16k after fine tune Google AI Studio fine-tuning , models	2	201	October 15, 2024
Truncated responses despite being under limits Gemini API api , gemini-2-5	2	233	June 11, 2025
Gemini 2.5 API bug: missing finishReason when max token limit is reached Gemini API api , gemini-api	1	508	April 30, 2025

2.5 Flash Lite Preview number inconsistency

Related topics