Quick fixes regarding the 'Thinking' model

The thinking model, unfortunately, has very strange priorities. When working with code, I fully understand that the model will not always succeed. When I, as a user, tell it what it did wrong, the model exceptionally falls into a mania of apologizing. You can trace its reasoning, and it devotes most of it to apologizing to the user and figuring out how to do it in a wonderful way. This is extremely annoying and unprofessional. I have the impression that DeepSeek3 doesn’t do this, or at least not to such an extent that it becomes particularly irritating.

If the model devotes a lot of attention to apologizing, it is even less likely to correct the parts where it made mistakes.
I don’t know if other users need this, but personally, at least when working with code, the model should be 100% focused on the code. Without praising the user for good ideas, and without apologizing when it makes mistakes.

6 Likes

You are not alone in thinking the models are overdoing it with apologies. Here’s another post with the same observation - How to stop Gemini being offensively obsequious?

The thinking model just makes the trained behavior more transparent, all of them do it. And to your point, using a significant portion of the thinking budget figuring out how to apologize doesn’t help with writing better code, which ultimately is what developers care about.

2 Likes

Oh, thanks for the link. It seems I wasn’t the only one who felt that way. One can only hope that the next version will have this aspect reduced by default.

Do you remember how bing argued with users? and if he didn’t like something, was he offended and ended the dialogue? imao :rofl:

1 Like

did you set a system prompt? I mean, my guy doesn’t apologize at all lol, he is perfectly condescending :slight_smile:

1 Like

The instances where the model apologized excessively occurred when I used it without system instructions. I understand that these instructions will address this issue. I simply believe that the model should, by default, exhibit less of this behavior in technical contexts.