Do Thought Signatures negate the need to pass back all context in a multi-turn conversation?

Bartek_Ciszkowski · January 29, 2026, 11:25pm

Historically with multi-turn conversations (and with other providers), I’ve passed the entire conversation (with all assistant, user, and tool outputs) back to the chat when initializing a new turn.

With Gemini, there are thought signatures (https://ai.google.dev/gemini-api/docs/thought-signatures) which suggests that some amount of memory and reasoning is retained inside this opaque token.

I noticed I can omit the tool outputs and Gemini is still able to answer questions about data returned in turns prior, likely referencing the thought signatures I sent (although it does seem slower in doing so)

Can anyone from the Google team recommend the correct approach here? Should I use thought signatures + memory tool to enable the model to read from memory when it thinks it needs to?

Are there any tradeoffs not documented with using thought signatures?

Thanks!

Topic		Replies	Views
Thinking with multi-turn conversation Gemini API llm , thinking	7	515	June 27, 2025
400 Invalid Argument with thought_signature on text parts Gemini API api , gemini	1	572	November 25, 2025
Thought signature not preserving information? Gemini API api	0	18	April 14, 2026
I’m trying to understand how Gemini API api , models	1	130	June 23, 2025
Does the new Gemini Chat support history with stateful sessions? Gemini API api	2	344	June 19, 2025

Do Thought Signatures negate the need to pass back all context in a multi-turn conversation?

Related topics