Gemini 2.5 Pro sometimes outputs its internal thought process in the final response

sunmoon1 · August 11, 2025, 2:26am

I am using the Gemini 2.5 Pro API and have noticed that in some cases the model’s internal reasoning or “thought process” is included in the final output that is returned to the user.

From my understanding the model’s thinking should be done internally and only the final answer should appear in the output unless includeThoughts is explicitly enabled in the request. In my implementation I am not setting includeThoughts and I am only reading the main response.text yet the output occasionally contains what looks like step by step reasoning or meta commentary that should remain hidden.

I have seen other developers report similar issues in community forums so it seems to be a reproducible bug rather than an isolated case.

Could the team clarify under what circumstances this can happen and if there is a way to guarantee that internal thinking will never appear in the user visible output?

Sven_Yu · August 11, 2025, 5:13am

The same thing happens with gemini 2.5 flash, which is devastating for the user experience.

Akhilesh_Kambhampati · August 11, 2025, 8:29pm

@sunmoon1 , @Sven_Yu

welcome to the forum ,

the thoughts are usually generated as output token for the model , this help to structure the thinking for the task associated. this should not result in thoughts in the output of the models. that said llms are not always deterministic best thing is to specify to not include any extra text like thoughts in the actual output

is this issue persistent over multiple tries? also, can you give any prompts or cases where you were able to recreate it .
have you tried any prompt engineering to ask to “avoid generatig thoughts and give me concise response”. this might be able to avoid

Rohit_Pachar · November 26, 2025, 9:27am

faced the same issue I changed the model from openai to gemini 2.5 flash …the customers received the internal though as response in some cases

Topic		Replies	Views
Gemini 2.5 Pro Exposing "Silent Thought" Process in Long Context Conversations Gemini API api , models , gemini	2	181	October 9, 2025
How to retrieve "Thoughts" (reasoning process) along with the response using Gemini API? Gemini API api , experimental	1	406	May 12, 2025
Gemini 2.5 Pro often not closing thoughts (05-06 does work correctly) Gemini API function-calling , gemini-2-5	5	356	November 26, 2025
How to Reduce Thought Reasoning in Gemini 2.5 Pro Gemini API api , models	7	2265	June 9, 2025
Thoughts are missing (CoT not included anymore) Gemini API gemini-20	12	2916	February 11, 2025

Gemini 2.5 Pro sometimes outputs its internal thought process in the final response

Related topics