Is there a way to decrease the response time from Gemini 1.5 pro? I am sending the same image and the same prompt multiple times and always get a very variable response generation time.
1 Like
The response time of Gemini-1.5-Pro
is generally higher because it is a more complex and resource-intensive model. If you prefer faster responses, consider using Gemini-1.5-Flash
.
Variations in response time can be influenced by server availability.
If you are consistently using the same image or prompt in your requests, you may want to explore context caching. Here is the link.
Thanks