Vertex fails to analyze .mov files (video/quicktime) containing apac spatial audio

Hi,

According to the official documentation, video/quicktime is listed as a supported MIME type for video understanding with Vertex AI models.

However, when analyzing a .mov video recorded on an iPhone 16 Pro Max using spatial audio (which is enabled by default), the API returns a generic HTTP 400 with "invalid argument" and no further details. This video includes an apac audio stream, as identified by FFmpeg.

If I switch the iPhone settings to record in stereo only - or re-encode the video using ffmpeg to remove the apac stream - the video is accepted and analyzed successfully.

I believe this is either a validation bug or an undocumented limitation.

Could you please clarify:

  • Is apac audio explicitly unsupported?
  • Should .mov files from iOS with spatial audio be preprocessed before being sent to Vertex?
  • Can the error response be improved to identify unsupported streams?

Thanks.

Hello,

Welcome to the Forum!!

May I ask which model you are using?

Hi @Lalit_Kumar

I believe I have the same issue.

I’m using Vertex API, Gemini 2.5 pro and flash both return the same cryptic 400 invalid argument.

Still an issue here in September. mov doesn’t work when trying to send videos from my iphone to the gemini api. If i convert to mp4, it works.

Docs still state mov as accepted.