I am using gemini-3.1-flash-image-preview for generating images. In the doc it mention thinking is enabled by default, but if log usage metadata i see thinking token cost is zero. so if i add thinking_config, include_thoughts=True, thinking budget=2048, i get the thoughts back in my response. I want to know how does it work? By allowing thinking budget would my model think and generate better images? or the models thinks anyway. It just returns the thoughts when i add budget and include thoughts? Also in the docs it doesn’t mention how does thinking token cost.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Do thinkingBudget tokens count toward billed output in Gemini 2.5 Flash? | 1 | 192 | July 11, 2025 | |
| How much thinking is enough for agentic applications | 1 | 121 | October 15, 2025 | |
| Gemini-2.5-flash-preview-04-17 not honoring thinking_budget=0 | 5 | 1736 | April 22, 2025 | |
| Latest @google/genai with 2.5 flash ignoring thinking budget | 11 | 658 | December 2, 2025 | |
| Thinking in flash-preview-09-2025 | 7 | 171 | January 5, 2026 |