I am impressed so far with the speed and capabilities of Flash 2.0 voice in/out (realtime) API. However it doesn’t seem to have very good voice activity detection, frequently interrupting the user. In addition, the voices are a little less natural-sounding (to my ear) than some competitors. Just providing feedback
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Gemini Flash 2 Realtime Bugs | 0 | 60 | February 11, 2025 | |
Gemini 2.0 Flash Notes | 0 | 228 | December 15, 2024 | |
There is Lag when using the MultiModal API from the open source code | 1 | 54 | February 25, 2025 | |
Gemini Flash 2.0 is useless? | 5 | 1096 | December 23, 2024 | |
Gemini flash 2.0 not responding right | 1 | 228 | January 29, 2025 |