I am impressed so far with the speed and capabilities of Flash 2.0 voice in/out (realtime) API. However it doesn’t seem to have very good voice activity detection, frequently interrupting the user. In addition, the voices are a little less natural-sounding (to my ear) than some competitors. Just providing feedback
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Gemini Flash 2 Realtime Bugs | 0 | 93 | February 11, 2025 | |
Gemini 2.0 Flash Notes | 0 | 240 | December 15, 2024 | |
Latency problems API gemini 2.0 flash multimodal life | 2 | 92 | March 25, 2025 | |
Gemini Flash 2.0 is useless? | 5 | 1175 | December 23, 2024 | |
Severe Degradation in Gemini Flash 2.0 API Performance — Tool Use and Output Quality Affected | 0 | 214 | April 9, 2025 |