Understanding the limit of fine-tuned Gemini Flash

Finsheet_Mail · October 18, 2024, 12:27pm

According to the doc of fine-tuning Gemini, “The input limit of a tuned Gemini 1.5 Flash model is 40,000 characters”. I wonder what does “input limit” mean? is it the total context window (include all previous questions and Gemini’s previous answer) or only the latest query input that is subjected to this limit (the context window is still 1mil tokens)? Thanks

Selam_Waktola · October 21, 2024, 6:59pm

Clarification of Input Limit for Fine-tuned Gemini 1.5 Flash Models:

The input limit of 40,000 characters for fine-tuned Gemini 1.5 Flash models applies specifically to the data you send and receive within a single interaction. This means:

Fine-tuning Datasets: Each training example (input prompt and desired response) must be less than or equal to 40,000 characters. Refer to the fine-tune tutorial
Using the Tuned Model: Each request (input prompt) you send to the fine-tuned model must be less than or equal to 40,000 characters.

Important Note: This limit is not related to the overall context window of the base Gemini 1.5 Flash model, which remains at 1 million tokens. This means the model can still remember and reference information from previous interactions within this limit.

Topic		Replies	Views
Token count decrease from 1 mil to 16k after fine tune Google AI Studio fine-tuning , models	2	211	October 15, 2024
Input token limits after Finetuning Gemini API gemini-15 , fine-tuning	3	202	February 20, 2025
How many characters is allowed in each prompt Google AI Studio ai-studio , prompt	1	209	April 17, 2025
Maximum Output Tokens from Tuned Models Google AI Studio gemini-15 , training	2	619	October 8, 2024
Output tokens limit for the finetuned gemini flash 1.5 Gemini API fine-tuning	12	2592	October 12, 2024

Understanding the limit of fine-tuned Gemini Flash

Related topics