Can I generate Images or audios in Google AI Studio?

afirstenberg · June 29, 2024, 2:35pm

No. Access to the Imagen models isn’t through the AI Studio API. You’ll need to use the Vertex AI API.

While you can do audio input with Gemini, it doesn’t do audio output. You’ll need to use the Google Text-to-Speech (TTS) API for that.

The largest latency I tend to notice is in the LLM portion itself, not in the STT or TTS portions. What latency numbers are you getting for each?

Topic		Replies	Views
Cannot generate images in Google AI Studio Google AI Studio gemini-15 , models , gemini-20	4	1290	June 12, 2025
Can Gemini API produce text to Image Gemini API gemini-15	2	247	June 23, 2024
Hi everyone how to use gemini ai for creating images through api Google AI Studio gemini-api	1	181	February 26, 2025
AI endpoint for Image generation which allows image+prompt -> image Gemini API api , gemini-api	6	174	October 7, 2024
Google AI studio - Image stopped being created Gemini API help_request	5	405	May 16, 2025