Gemini-flash-latest on Vertex AI: Cannot determine actual model version + gemini-3-flash-preview quality regression

Jiawei_Cui · April 13, 2026, 4:03pm

Context

We are running a production conversational AI agent on Vertex AI (location=global) using Google ADK (Agents Development Kit). Our agent uses tool calling (function calling) heavily — it orchestrates vehicle search, result presentation, and follow-up suggestions through multiple sub-agents.

We have been using gemini-flash-latest as our default model and are very happy with its quality — especially its tool-calling reliability, response formatting, and natural language output style.

Problem 1: Cannot resolve what gemini-flash-latest points to

There is no API to determine the actual model version behind the gemini-flash-latest alias on Vertex AI.

We tried:

models.get("gemini-flash-latest") returns name: publishers/google/models/gemini-flash-latest, version: default. No resolved model info.
Response metadata model_version field just echoes gemini-flash-latest back.
REST API GET /v1/publishers/google/models/gemini-flash-latest returns 404 on global endpoint.

We then ran empirical side-by-side tests (temperature=0, no thinking, identical prompts) comparing gemini-flash-latest against known model versions:

Test (7 deterministic prompts)	Exact hash match with flash-latest
`gemini-2.5-flash`	5/7
`gemini-3-flash-preview`	4/7

Conclusion: gemini-flash-latest on Vertex AI (location=global) is not identical to either gemini-2.5-flash or gemini-3-flash-preview. It appears to be a distinct, unpinnable snapshot.

According to the changelog (Jan 21, 2026), gemini-flash-latest was switched to gemini-3-flash-preview on the Google AI API — but this does not appear to match Vertex AI behavior.

Problem 2: gemini-3-flash-preview quality regression

We ran the same agent pipeline (ADK, multi-agent with tool calling) across all three models:

Metric	gemini-flash-latest	gemini-2.5-flash	gemini-3-flash-preview
Tool calling	Correct	Correct	Correct
Output tokens	362	292	283
Response quality	Natural, well-formatted	Good, slightly different style	Less polished
Bold formatting	Appropriate emphasis	Phrase-level bold	Minimal formatting

In our broader testing, gemini-3-flash-preview shows less natural phrasing and more terse output that does not match our UX standards. We are not confident upgrading to it.

Requests

Provide an API to resolve model aliases — models.get() or response metadata should return the actual underlying model version, not just echo the alias.
Clarify Vertex AI vs Google AI alias mapping — The changelog documents alias switches for the Google AI API. Does the same schedule apply to Vertex AI?
Provide a way to pin the current gemini-flash-latest snapshot — If the alias is about to change, we need the exact snapshot ID so we can pin before it rotates.

Related GitHub issue: `models.get()` does not resolve alias to underlying model version on Vertex AI · Issue #2271 · googleapis/python-genai · GitHub

Environment

Platform: Vertex AI
Location: global
SDK: google-genai (Python)
Framework: Google ADK (Agents Development Kit)

Topic		Replies	Views
Inconsistent Model Name: gemini-flash-latest Works in API but Dashboard Shows gemini-2.5-flash Gemini API ai-studio , bug , api , models , gemini-api	1	137	January 9, 2026
gemini-3.5-flash alias intermittently returns 404 (routes to decommissioned backing build gemini-v4p1s-rev24-ajax-sentinel) Gemini API models , gemini-flash	0	83	May 31, 2026
Downscaling degradation issue started happening today with NB2 Gemini API gemini-3	1	185	March 5, 2026
Severe Degradation in Gemini Flash 2.0 API Performance — Tool Use and Output Quality Affected Gemini API model-quality	1	562	August 7, 2025
Gemini-2.0-flash intelligence regression (and parallel tools?) Gemini API prompt , gemini-flash	1	192	June 4, 2025