Questions about Gemini 2.5 Flash Pricing and Thinking Budget in Vertex AI

Hao_Liu · April 18, 2025, 9:44am

Hi everyone,

I’m currently working with Gemini 2.5 Flash on Vertex AI, and I’ve noticed something puzzling regarding pricing and budget control. I would really appreciate it if someone could help clarify the following two points:

1. Reasoning vs No Reasoning Pricing

I noticed a big price difference between “no reasoning” and “reasoning” modes.

Does this mean the model charges separately for internal thinking and output?
Can I use thinking_budget to control the cost of reasoning?

2. Thinking Budget on Vertex AI

Is the thinking_budget option available when calling Gemini 2.5 Flash via Vertex AI ?
If yes, I’d really appreciate an example of how to use it with Vertex AI.

Appreciate any help or examples. Thanks!

Best,
Liu

Siva_Sravana_Kumar_N · May 18, 2025, 8:04pm

Hi @Hao_Liu,

Welcome to forum, thank you for filing the question. I hope now you have toggle for thinking in ai studio and as well in vertex ai.

Thank you.

Topic		Replies	Views
Pricing for Gemini 2.5 API: With and Without Thinking Option in the Official Release Gemini API billing , thinking , gemini-2-5	5	365	July 18, 2025
Are the thinking tokens counted in the output price for 2.5 Flash? Gemini API thinking , gemini-2-5	1	190	June 13, 2025
Gemini-2.5-flash-preview-04-17 not honoring thinking_budget=0 Gemini API help_request	5	1456	April 22, 2025
Correct prices for Gemini API models? Gemini API gemini-flash , billing	1	56	September 17, 2025
Does Gemini 2.5 Flash use different models for reasoning vs. non-reasoning outputs? Gemini API models , thinking , gemini-2-5	2	140	June 24, 2025

Questions about Gemini 2.5 Flash Pricing and Thinking Budget in Vertex AI

1. Reasoning vs No Reasoning Pricing

2. Thinking Budget on Vertex AI

Related topics