Gemini Vision API Pricing

Ozgur_Ugurlu · February 26, 2025, 7:37am

I have a question about Gemini Vision. According to documentation sources, each uploaded photo consumes approximately 258 tokens (though this may vary by pixel size). Currently, it can be used with gemini-1.0-pro-vision, gemini 1.5-flash, gemini-1.5-pro, and gemini-2.0-flash.

In this case, are these 258 tokens calculated according to each model’s API pricing? If so, what is the difference between these models in terms of vision understanding? Does gemini-2.0-flash have better vision understanding than gemini-1.5-pro?

gemini-2.0-flash-lite has also been added to the API. Is it possible to use vision with this model as well? I think Google doesn’t pay enough attention to details regarding vision capabilities.

Actually, I’m going to use it as OCR for mathematical problems. I want to decide which one would be best for me.

I couldn’t find any benchmarks or resources about this.

Thank you for your support.

Pannaga_J · May 22, 2025, 10:28am

Hi @Ozgur_Ugurlu Welcome to the community. Apologies for the late response
We have released a new model -Gemini 2.5 Pro is positioned as Google’s most intelligent model, with state-of-the-art performance in areas requiring advanced reasoning, including multimodal understanding.Please try this for your use case and let us know if there is any issue
Thank you

Topic		Replies	Views
Is there pricing for gemini-2.0-flash-exp? Gemini API gemini-flash	2	90	March 22, 2025
Gemini 1.5 Flash Image Input Pricing Gemini API	6	1187	July 24, 2024
Gemini vision pricing Gemini API api , vision , python	4	393	October 6, 2024
### 📌 Questions for the Google Gemini API Team Gemini API api	1	68	March 12, 2025
Costs of Gemini Flash 2.0 Experimental With Image Generation - Unusually Low? Gemini API gemini-flash , billing , gemin-flash-image , image-generation	1	93	May 22, 2025

Gemini Vision API Pricing

Related topics