The published model alias gemini-3.5-flash on the Gemini Developer API (generativelanguage.googleapis.com) intermittently returns 404 NOT_FOUND for a large fraction of generateContent requests. The error names an internal backing build that does not exist:
404 NOT_FOUND
{
“error”: {
"code": 404,
"message": "Model not found: models/gemini-v4p1s-rev24-ajax-sentinel",
"status": "NOT_FOUND"
}
}
The same alias succeeds on the remaining requests, which strongly suggests gemini-3.5-flash is load-balancing across multiple backing builds and one of them (gemini-v4p1s-rev24-ajax-sentinel) is decommissioned/unhealthy and needs to be pulled from rotation.
Impact
Production traffic using gemini-3.5-flash fails roughly two-thirds of the time, intermittently and unpredictably. This is not retryable via normal backoff because it surfaces as a 404, not a 429/503.
Reproduction (minimal, library-independent)
Plain curl against the REST endpoint, minimal request body, no SDK involved:
for i in $(seq 1 25); do
curl -s “https://generativelanguage.googleapis.com/v1beta/models/gemini-3.5-flash:generateContent?key=$GOOGLE_API_KEY” \
-H 'Content-Type: application/json' \\
-d '{"contents":\[{"parts":\[{"text":"hi"}\]}\]}' \\
| grep -o '"message": "\[^"\]\*"' | head -1
done
Observed result: 17 of 25 calls failed (~68%), every failure identical: "Model not found: models/gemini-v4p1s-rev24-ajax-sentinel". The other 8 returned normal completions.
Key facts
-
gemini-3.5-flashis present inGET /v1beta/modelsand the API key has access to it. -
A direct
generateContentcall togemini-3.5-flashsucceeds part of the time, so this is not an access/permissions or model-name issue — it’s specific backing builds behind the alias failing. -
Sibling/related models are 100% stable in the same test (0 failures out of 12 calls each):
-
gemini-flash-latest -
gemini-3-flash-preview -
gemini-2.5-flash
-
-
Reproduced with the raw REST API (no client library), so it is not a
google-genai/langchain-google-genaiissue. Also reproduced through thegoogle-genaiPython SDK.
Environment
-
API: Gemini Developer API (
generativelanguage.googleapis.com),v1beta -
Auth: API key (AI Studio), not Vertex AI
-
Date/time observed: 2026-05-29, ~22:00 UTC
-
Request body: minimal, no generationConfig / thinkingConfig required to trigger it
Requests
-
Please pull the unhealthy backing build
gemini-v4p1s-rev24-ajax-sentinelout of thegemini-3.5-flashalias rotation. -
Confirm whether
gemini-3.5-flashis a recommended GA alias or should be avoided in favor ofgemini-flash-latest.