Hi everyone,
I am experiencing some severe billing anomalies with the Gemini API and wanted to see if others are seeing similar patterns in their usage reports.
The Issues:
-
Billing for Unused Models:
I switched my application to use gemini-2.0-flash in January. I have confirmed logs showing that from Jan 7th to Jan 13th, we only sent requests to the 2.0 model.
However, my Google Cloud billing report shows a sudden spike on Jan 11th for gemini-2.5-flash (approx. 900 requests), resulting in a charge of ~ $17. A similar spike happened on January 3rd also. I am providing 2 screenshot of this issue for context- -
Anomalous Volume in December:
Prior to the switch, during December, I saw massive, sudden spikes in API requests for gemini-2.5-flash (ranging from 2,000 to 5,000+ requests). These spikes showed very large input token counts and did not match the actual traffic going to my application. My application does not make that much api request as shown in dashboard from AI studio. It costs more than $600 in just in December. Even in November there was some unexpected spikes in usage.
Summary:
Since the “phantom” spikes continued in January on a model version I was no longer using, it strongly suggests the high volume in December was also an anomaly rather than organic traffic which I predicted.
Has anyone else seen usage reported for model versions they aren’t calling, or sudden unexplained spikes in request counts?
How may I get a refund for this and where should I report this stuff?

