Which Gemini API Models Support top_k Sampling?

user1365 · March 5, 2025, 7:09am

I’m trying to determine which Gemini API models support top_k sampling. The official documentation at Google AI API states:

"For Top-k sampling.

Top-k sampling considers the set of topK most probable tokens. This value specifies default to be used by the backend while making the call to the model. If empty, indicates the model doesn’t use top-k sampling, and topK isn’t allowed as a generation parameter."

However, it’s unclear from the documentation which specific Gemini models allow the top_k parameter in generation requests.

Could anyone clarify which models currently support top_k, or if there is an alternative method for achieving similar behavior in models that do not? Thanks in advance!

jkirstaetter · March 5, 2025, 6:32pm

Hi @user1365

Welcome to the forum.

Did you check the list of models? The TopK is documented per model, if available.
Here an example.

    {
      "name": "models/gemini-2.0-flash-thinking-exp-1219",
      "version": "2.0",
      "displayName": "Gemini 2.0 Flash Thinking Experimental",
      "description": "Gemini 2.0 Flash Thinking Experimental",
      "inputTokenLimit": 1048576,
      "outputTokenLimit": 65536,
      "supportedGenerationMethods": [
        "generateContent",
        "countTokens"
      ],
      "temperature": 0.7,
      "topP": 0.95,
      "topK": 64,
      "maxTemperature": 2
    },

Cheers

Topic		Replies	Views
Is someone tell me what is topK and why it's accouring Gemini API api	2	400	October 29, 2024
Is it possible to filter /models to get only models available for generateContent? Gemini API api , models	3	81	March 11, 2025
About topK, topP and temprature Google AI Studio gemini-15 , ai-studio , api , models , datasets	1	4458	July 10, 2024
Does the Top-K Parameter Affect Gemini 2.5 Series Models? Gemini API api , build , gemini-api , gemini-2-5	4	300	November 3, 2025
Please make the `topK` parameter usable again Gemini API api , llm	1	201	June 23, 2025

Which Gemini API Models Support top_k Sampling?

Related topics