Generative AI on Vertex AI

Jose_Alencar · March 23, 2026, 4:38am

I’m evaluating the Gemini Live API (via Vertex AI or Gemini API) for a fictional MVP prototype: real-time voice conversations (speech-to-speech) with Gemini.

Scenario details:

100 simultaneous/ concurrent users
Each user has about 1 hour of active conversation per day
Total estimated: ~180,000 active conversation minutes per month (assuming 30 days)
Using VAD (voice activity detection) so only speaking time is billed (no silence)
Likely using models like gemini-2.5-flash-live or similar for low-latency voice

Specific questions:

What is the effective per-minute cost for a full duplex voice conversation (input audio + output audio + processing)? Is it around $0.011–0.012/min as some docs/calculations suggest, or has it changed?
What are the current concurrency limits for Gemini Live API? (e.g., max simultaneous WebSocket connections per project/region — Tier 1/2/3?)
- Can it handle 100 concurrent live sessions reliably?
- Any extra charges or setup needed for higher concurrency?
How is billing calculated exactly for Live API? (tokens per second of audio? Input + output separately? Any flat fees?)
Are there any preview limitations, regional restrictions, or best practices for scaling voice agents to this level?
Any recommendations for partners/integrations (like Daily co) for web/mobile voice frontend?

This is for early prototyping/MVP budgeting — any guidance, updated pricing sheet, or quota increase path would be super helpful.

Topic		Replies	Views
Regarding Google Project ready Voice module Gemini API gemini-15 , ai-studio , api , vertexai , gemini	2	102	November 27, 2025
Could someone help me understand gemini live pricing? Gemini API api , models , billing	1	444	June 23, 2025
Gemini Live 2.5 token counting - what is the expected cost of long-running video session? Gemini API api , billing	3	187	November 1, 2025
Do I get charged for generated tokens if client disconnects during a Vertex AI streaming response? Gemini API vertexai , open-ai	4	238	June 26, 2025
Vertex AI Live API: only native-audio reachable in EU and it breaks on turn 2; cascade models return Gemini API audio	0	41	May 14, 2026

Generative AI on Vertex AI

Related topics