Dear Guys,
I hope this message finds you well.
I am currently working with Gemini 1.5 Pro’s newest multi-model and am seeking a feature similar to the speech-to-text conversion available in the Vertex AI playground. While I have noticed that the current Gemini API examples perform inferencing in batches after uploading a file, my requirement is for real-time processing.
Could you please guide me on how to achieve real-time speech-to-text conversion using Gemini 1.5 Pro? Any assistance or direction you can provide would be greatly appreciated.
Thank you very much for your help.
Best regards,
Hung Truong