Hi @lafif
I believe the gemini model that offers the best balance of price and performance is Gemini 2.5 Flash lite, which provides low-latency and cost-effective solutions for high-throughput tasks .
For a slightly more capable model that is still very cost-efficient, Gemini 2.5 Flash is the ideal choice for large-scale, low-latency, high-volume tasks that require some thinking and agentic capabilities.
Thank you
@Pannaga_J Thanks for the suggestion. I checked the pricing, and it looks like Gemini 1.5 Flash-8B was still significantly cheaper for my use case. Input was around $0.0375 and output $0.15 per 1M tokens, while Gemini 2.5 Flash Lite is $0.10 input and $0.40 output.
The newer models seem faster and more capable, but for pure cost-efficiency, 1.5 Flash-8B still had a big advantage. Do you know if there’s any plan for a cheaper tier similar to 1.5 Flash-8B?