Gemini-3-flash-preview: Truncated/Garbage Output, Hallucination, and Incomplete Tool Calls in Production Testing

dretana · January 6, 2026, 3:52am

I’m testing gemini-3-flash-preview for a customer support AI agent (accessed via Google Generative AI API through n8n). I hit several issues that make it unusable for production.

Main issue: Garbage final output

In a multi-step agentic workflow, the model generated a correct response in one execution run (289 completion tokens). On the very next run with the same setup, it returned just ** (two asterisks) as its final output, with only 3 completion tokens. No error, just garbage. This seems similar to issues reported in gemini-cli GitHub (#10665, #7851) about empty responses and invalid chunks.

Other issues I encountered:

Hallucinated data - The model made up a phone number that doesn’t exist. For customer support, this is a dealbreaker.
Skipped tool calls - Sometimes the model skips tools it should call, jumping straight to a response without retrieving the data it needs.

What I tried:

Temperature at 1.0 (per docs)
Multiple test runs

Happy to share logs if useful.

Sonali_Kumari1 · January 7, 2026, 6:25am

Hi @dretana , Thanks for reaching out to us.

To help us diagnose this issue, please share a bit more detail about how the final response is being generated along with a screenshot of any relevant output or logs?

dretana · January 7, 2026, 7:06pm

Hello! Thanks for looking into this. Here’s more context:

Setup:

n8n workflow with an AI Agent node using gemini-3-flash-preview via Google Generative AI API using the Google Gemini Chat Model node
Agent has tools connected (for retrieving customer data, knowledge base lookups, etc.)
WhatsApp Business integration receives customer messages and triggers the workflow
Agent response is sent back to customer via WhatsApp

What happened: The workflow executed 3 runs for a single customer query:

Run 1: Tool calls executed
Run 2: Model generated a correct, complete response (289 tokens)
Run 3: Model output just ** (3 tokens) - this is what got sent to the customer

The problem is the workflow sent the output from Run 3 instead of Run 2.

Attaching:

Screenshot of the WhatsApp conversation showing the ** output received by customer
Screenshot of n8n execution showing the 3 runs and their outputs

dretana · January 7, 2026, 7:07pm

dretana · January 7, 2026, 7:08pm

dretana · January 7, 2026, 7:09pm

Topic		Replies	Views
Gemini 3 output limited to ~4k tokens instead of 65k Gemini API bug , api , gemini , api-key	9	1596	January 14, 2026
Truncated Response Issue with Gemini 2.5 Flash Preview Gemini API bug , gemini-flash	55	5329	October 1, 2025
Severe Degradation in Gemini Flash 2.0 API Performance — Tool Use and Output Quality Affected Gemini API model-quality	1	551	August 7, 2025
Problems With gemini-2.0-flash Tool Calling Gemini API gemini-flash , gemini-20	13	1924	June 19, 2025
Very frustrating experience with Gemini 2.5 function calling performance Gemini API bug , api , quality , performance , gemini-flash-2-5	12	2476	December 13, 2025

Gemini-3-flash-preview: Truncated/Garbage Output, Hallucination, and Incomplete Tool Calls in Production Testing

Related topics