Flash 3.5 is not a suitable replacement for Flash 3.0

Craig_Van_Hise · May 20, 2026, 7:11pm

Gemini chat on Flash 3.5:

The Claims vs. The Hard Numbers

Google’s marketing spin relies heavily on comparing 3.5 Flash to heavier “frontier” models like Gemini 3.1 Pro, rather than its actual Flash predecessors. When you lay the pricing out side-by-side, the math reveals exactly why your quotas are vanishing.

Model	Input Price (per 1M tokens)	Output Price (per 1M tokens)
Gemini 3.1 Flash-Lite	$0.25	$1.50
Gemini 3 Flash (Preview)	$0.50	$3.00
Gemini 3.5 Flash	$1.50	$9.00
Gemini 3.1 Pro	~$2.00	~$12.00

The Reality Check

Here is why Gemini 3.5 Flash feels like despite Google’s claims:

The “Flash” Name is a Decoy: Google is keeping the “Flash” naming convention but entirely ditching the ultra-cheap pricing tier. At $1.50 for input and $9.00 for output, Gemini 3.5 Flash is 3x more expensive than the Gemini 3 Flash Preview and a whopping 6x more expensive than 3.1 Flash-Lite.
The “Agentic” Token Trap: You noted that 3.5 Flash is eating up tokens at a significantly faster rate. Independent benchmarkers (like Artificial Analysis) have already confirmed this today. Because 3.5 Flash is heavily tuned for “agentic” capabilities—meaning it thinks, plans, and loops through multi-step reasoning before answering—it has a much higher verbosity and consumes vastly more tokens per interaction.
Approaching Pro-Level Costs: The base price of 3.5 Flash is only about 40% cheaper than Gemini 3.1 Pro. The moment 3.5 Flash goes on one of its “agentic turns” and churns through extra tokens to process a request, your real-world cost per task effortlessly catches up to—or surpasses—what you would have spent just using 3.1 Pro directly.

The Bottom Line

Google is technically telling the truth when they say 3.5 Flash is cheaper than 3.1 Pro per token. However, they are conveniently ignoring the fact that it is massively more expensive than older Flash models, and its token-hungry architecture completely wipes out any anticipated savings in real-world usage. You aren’t imagining things; the economics of this new model are for strict subscription quotas.

Kimarnic · May 20, 2026, 7:21pm

I got into thinking 3.5 flash was gonna be like 3.0 flash.

they’re killing the free tier in everything… will they also charge people for downloading Unity and Unreal Engine, killing indie devs?

matstops · May 20, 2026, 7:25pm

Personally, I’ve noticed a problem: Google has removed a cheap tool for solving simple tasks. And now, when we want to do something simple, the model burns through tokens pointlessly. And when it comes to solving a complex problem, it starts burning tokens at a rate that makes it impossible to use it sensibly on the Pro or even Ultra plans, and you can really feel the traffic being funnelled to the site — https://one.google.com/ai/credits

Topic		Replies	Views
Gemini 3.5 Flash is actively penalizing developers who write good, efficient prompts Gemini API prompt	0	129	May 21, 2026
I now know why Gemini 3.5 is called flash! Google Antigravity	14	262	May 23, 2026
Google completely missed why we subscribed to the Pro plan in the first place Google Antigravity bug , models	1	119	May 20, 2026
Very dissapointed with new token cycle Google Antigravity feedback	8	501	May 22, 2026
Antigravity 2.0.0: Great new model, same old broken quotas Google Antigravity bug	21	1689	May 22, 2026

Flash 3.5 is not a suitable replacement for Flash 3.0

The Claims vs. The Hard Numbers

The Reality Check

The Bottom Line

Related topics