I noticed that Gemini API returns errors consistently when the prompt contains audio data. This started happening on 2025-04-02 according to my API usage stats.
Error Messages
The error is one of these two:
- [GoogleGenerativeAI Error]: Error fetching from https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-flash-002:streamGenerateContent?alt=sse: [503 Service Unavailable] The model is overloaded. Please try again later.
- [GoogleGenerativeAI Error]: Error fetching from https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-flash-002:streamGenerateContent?alt=sse: [500 Internal Server Error] Unable to submit request because the service is temporarily unavailable.
Google Cloud API Metrics
In Google Cloud Console → APIs & Services → Generative Language API, as of writing this post the error rate is over 95% making the service almost unusable. Billing is turned on and I am using the paid API, not the free tier.
Gemini API Status
Despite the very high error rate, Gemini API’s status page says that “All Systems Operational.”
I suggest that Gemini team implement a separate status check for audio processing.