Severe Degradation in Gemini Flash 2.0 API Performance — Tool Use and Output Quality Affected

Victor_Billaud · April 9, 2025, 8:17am

Hi everyone,

We’re currently experiencing major performance issues with the Gemini Flash 2.0 API, particularly affecting tool use reliability and output coherence in structured conversation flows.

We’re building a voice-based interview agent, and during a high-stakes filmed demo today, the agent failed to progress through its stages. Despite no code or infra changes on our end, the LLM stopped using tools reliably, which are critical for managing logic and storing responses. This resulted in conversation loops, hallucinations, and general inconsistency — all of which were not present in previous tests (two days prior).

We’ve since rerun our test suite and observed a clear degradation in quality, independent of any updates on our side. This aligns with past reports from the community suggesting instability around new model rollouts or internal infrastructure changes:

• Reddit thread on Gemini 2.0 Flash failing evals

• Google AI Forum discussion

A potential theory (unconfirmed) is that the recent free availability of Gemini 2.5 Pro may be triggering infra shifts or quantization on the Flash 2.0 side, degrading its output. If this is the case, we’d love to understand what’s happening behind the scenes and whether this is a temporary phase.

Would appreciate any clarity from the Gemini team or confirmation that this is being looked into — and if other developers are noticing similar issues, let’s compare notes.

Thanks !!

Krish_Varnakavi1 · August 7, 2025, 1:05am

Hi @Victor_Billaud,

Welcome to the Google AI Forum!

Apologies for the delay in response.

We have made several improvements to tool calling in the 2.5 family of models over the last couple of months. I would highly encourage using 2.5 switching to 2.5 family..

Also do let me know if you are still running into issues?

Topic		Replies	Views
Gemini-2.0-flash intelligence regression (and parallel tools?) Gemini API prompt , gemini-flash	1	136	June 4, 2025
Gemini 2.0 Flash: Declining Tool Usage Reliability with Increasing Conversation Length (Vercel SDK) Gemini API models , gemini-flash , gemini-20	3	120	May 19, 2025
⚠️ Gemini API Instability During New Model Releases Gemini API gemini-15 , feedback , api	3	437	June 17, 2025
Gemini 2.5 Flash Quality Degradation based on internal Evals Gemini API api , models , gemini-flash , gemini-20	6	186	August 9, 2025
Issues with Gemini 1.5 Flash API Performance Google AI Studio gemini-15 , api , models	1	244	July 3, 2025

Severe Degradation in Gemini Flash 2.0 API Performance — Tool Use and Output Quality Affected

Related topics