I’ve been using the Gemini API via Google AI to work with Gemini 1.5 Pro, but I’ve noticed that the responses can sometimes be slow (regardless of the number of tokens). I’m planning to set up a chatbot for our website and recently learned about using the Gemini API through Vertex AI. For a production environment, would it be better to use Vertex AI instead? Is it generally more stable than accessing the Gemini API directly through Google AI?Also, what are the other benefits of using Vertex AI over Google AI?
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Basic question: what is the difference between the Gemini API and Vertex? | 2 | 2163 | July 3, 2024 | |
| Pricing Difference | 1 | 237 | May 23, 2025 | |
| Significant Difference in Response Quality between Google AI Studio and Gemini 2.5 Pro API (gemini-2.5-pro-03-25) | 7 | 1142 | June 4, 2025 | |
| Python package for Gemini API: google-cloud-aiplatform or vertexai? | 9 | 550 | May 17, 2024 | |
| Response time for Gemini API | 5 | 1464 | December 13, 2024 |