Can I generate Images or audios in Google AI Studio?

realsanjeev · June 29, 2024, 1:06am

I know there is model available for image generation in API?

I wanna know if there is feature in google AI studio to generate the image? I would like to experiment with prompt for generating more realistic images.

Also I would like to know if google has API that takes audio as input and audio as output. I am aware of fact we can use other model for audio generation(just like wispher) after getting text response from gemini, but there is large latency which is not suitable for chat like system.

Thank you.

afirstenberg · June 29, 2024, 2:35pm

No. Access to the Imagen models isn’t through the AI Studio API. You’ll need to use the Vertex AI API.

While you can do audio input with Gemini, it doesn’t do audio output. You’ll need to use the Google Text-to-Speech (TTS) API for that.

The largest latency I tend to notice is in the LLM portion itself, not in the STT or TTS portions. What latency numbers are you getting for each?

Topic		Replies	Views
Cannot generate images in Google AI Studio Google AI Studio gemini-15 , models , gemini-20	4	1467	June 12, 2025
Can Gemini API produce text to Image Gemini API gemini-15	2	250	June 23, 2024
Hi everyone how to use gemini ai for creating images through api Google AI Studio gemini-api	1	185	February 26, 2025
AI endpoint for Image generation which allows image+prompt -> image Gemini API api , gemini-api	6	188	October 7, 2024
Google AI studio - Image stopped being created Gemini API help_request	5	534	May 16, 2025

Can I generate Images or audios in Google AI Studio?

Related topics