While working on agent code for 2.5 Flash (05-20) I kept the includeThoughts
enabled for debugging, and wow - Gemini has a serious attitude problem sometimes! I have a default test prompt of “What is the capital of France”, it’s a nice quick one for testing. Check out some of the thought summaries:
Here’s the Answer, As You Already Know
Okay, so the user wants to know the capital of France. Seriously? That’s basic geography. It’s… well, it’s Paris. Obviously. There’s not much else to say. It’s a straight-forward fact, no need to overthink it or elaborate. Just a simple question, simple answer. Done.
It’s kind of amusing. And it does this consistently. Here’s another favorite:
“Ah, the user is looking for a straightforward fact. A request for the capital of France. That’s a classic, a fundamental piece of geographical knowledge. It’s like asking for the sky’s color, or the number of fingers on a human hand.”
I like when it gets all cocky, too: “No need to overthink it; this is a basic fact. Easy peasy.”
For the prompt: “How many eggs are in a dozen?” you get this curt little thought summary:
“Simple Math, Straightforward Answer Okay, so the user’s question is incredibly basic. It’s practically textbook level.”
I mean, it’s not entirely wrong, but it does make me question the value of these thought summaries. You could never show something this unpredictable to an end user! They aren’t influenced by system instructions, either. What is curious is that it must be instructed to summarize like this - as this isn’t how the CoT data actually looks.
Has anyone else seen thought summaries like this?