Gemini Flash Multimodal release

IKGN · March 11, 2025, 6:41pm

I’m planning a pilot, can you may be give me an idea about when multimodal is going to be released? Not an exact date, but rather will it be weeks? months? half a year?
Many thanks!

jkirstaetter · March 13, 2025, 6:18pm

Hi @IKGN

Not sure what you are asking for but multimodal response is already available. At least in Gemini 2.0 Flash models.

See here: Generate images | Gemini API | Google AI for Developers

Cheers.

Gabriel_Caratihan · March 14, 2025, 8:28am

the text and image generation output has been released for experimental use. try it out via google ai studio. just choose gemini 2.0 flash experimental

IKGN · March 14, 2025, 9:30am

I meant multimodal live API, sorry, voice to voice. Multimodal Live API | Gemini API | Google AI for Developers That’s still experimental and restrictions apply (3 concurrent sessions per API key etc.). I’m trying it out since 3 months already and would like to scale a bit more.

Topic		Replies	Views
Unable to see output format dropdown for Gemini 2.0 flash model Google AI Studio gemini-flash	1	270	December 19, 2024
Gemini 2.0 Live Multimodal Model Rest API Docs Gemini API api , gemini , documentation	1	75	April 29, 2025
Expected timeline for Gemini 2.0 models to be available Google AI Studio gemini-20	8	664	March 20, 2025
Gemini-2.0-flash-exp release? Gemini API gemini-20	2	89	March 15, 2025
When will the API support responding with voice or images? Gemini API model , gemini-flash	3	112	April 3, 2025

Gemini Flash Multimodal release

Related topics