Gemini 2.0 Flash: Declining Tool Usage Reliability with Increasing Conversation Length (Vercel SDK)

Hey team,

Wanted to flag something we’re seeing with Gemini 2.0 Flash, specifically around tool usage in longer conversations.

We’ve noticed that the longer the chat goes on, the more hesitant - or just plain unwilling - the model gets about actually calling the 7 tools we’ve set up. We’re using the latest Vercel SDK, and the prompt was even built specifically for this model.

The weird thing is, even when we try to explicitly tell the model in the prompt or query to use a tool, it often just skips it after a certain number of turns.

What’s really throwing us is that we took the exact same prompt and setup and tested it with a competitor’s model, and it worked perfectly – consistently using the tools no matter how long the conversation got.

So, it feels like it’s something specific to how Gemini 2.0 Flash handles tool calling when the context grows.

Has anyone else run into this with 2.0 Flash and tools? Or have any ideas on what might be going on? It’s getting a bit frustrating for our use case.

1 Like

Hey @Francisco_Javier_Saa , Welcome to the forum.

Seems like a model quality issue. Have you tried using a more powerful model, like gemini-2.5-flash?

Hey @GUNAND_MAYANGLAMBAM,

Thank you so much for your response. Gemini 2.5 flash is not available yet using Vercel SDK, but is true that with the pro model the results are better, but is not the right model for this use case.

Do you have any information about when Gemini 2.5 flash will be available through this library?

Not sure, especially since the SDK isn’t from Google.