AI Studio App > Production Ready

Hello everyone, I’ve got an AI voice app that I’m looking to deploy in a production-ready environment as an MVP. We’re got a basic version in AI studio, but we’re looking to move into using Vertex AI API for voice and sorting the app intro front-end / back-end infrastructure in Cloud run.

But to be honest I’m having a massive headache. I’m just wondering if anyone has any advice or ideas on just deploying this? I’ve been wracking my head for ages.

Cheers,
Probably a Fool

To deploy the AI voice app from AI Studio we can use Vertex AI and Cloud Run.
The key steps for it would be:

  1. Switch from AI Studio to Vertex AI endpoints for text-to-speech or multimodal calls:
    Generative AI on Vertex AI  |  Google Cloud

  2. Backend Deployment via Cloud Run: Containerize the backend (e.g., Flask, FastAPI, Node.js) and deploy to Cloud Run:
    Quickstarts  |  Cloud Run Documentation  |  Google Cloud
    (*Need to set up authentication and pass your Vertex AI API key securely.)

  3. Frontend Hosting: If we are exposing a UI, host it using Firebase Hosting or Cloud Run with a static frontend: Firebase Hosting
    or reuse Cloud Run for both frontend/backend.

  4. Security & Monitoring Setup: Use Cloud IAM to restrict API access: Identity and Access Management documentation  |  IAM Documentation  |  Google Cloud
    Set up Cloud Logging/Monitoring: https://cloud.google.com/monitoring
    Check quotas here: https://console.cloud.google.com/iam-admin/quotas