Gemini 2.5 Flash Preview Native Audio Live API wrong turn detection

We often experience that the model stops speaking in the middle of its response. The log shows “Turn detected” although VAD is disabled. Any ideas? Is there a better way to force the model to finish it’s response or is the cascading model the more reliable approach?