Gemini Flash 2.0 Experimental VAD is pretty bad

I am impressed so far with the speed and capabilities of Flash 2.0 voice in/out (realtime) API. However it doesn’t seem to have very good voice activity detection, frequently interrupting the user. In addition, the voices are a little less natural-sounding (to my ear) than some competitors. Just providing feedback

1 Like