Edit: gemini-1.5-pro-002
does work in europe-west3
now, but:
- My quota for gemini-1.5-pro in europe-west3 is 50 requests / minute.
- For
gemini-1.5-pro-001
this works out exactly - However, for
gemini-1.5-pro-002
it’s closer to 1 request / minute
The billing account is active but still has free usage credits. Could this be the reason for the lower limits with 002?
Apart from that, where can I see the quota and usage?
Original post:
I can’t use gemini-1.5-pro-002
in europe-west3
, but it works in all other regions.
This code fails:
import vertexai.generative_models as genai
model = genai.GenerativeModel("gemini-1.5-pro-002")
model.count_tokens('test')
with this exception:
_InactiveRpcError: <_InactiveRpcError of RPC that terminated with:
status = StatusCode.FAILED_PRECONDITION
details = "Project `********` is not allowed to use Publisher Model `projects/horama-dev/locations/europe-west3/publishers/google/models/gemini-1.5-pro-002`"
debug_error_string = "UNKNOWN:Error received from peer ipv4:********:443 {created_time:"2024-09-25T15:43:53.563304941+02:00", grpc_status:9, grpc_message:"Project `********` is not allowed to use Publisher Model `projects/********/locations/europe-west3/publishers/google/models/gemini-1.5-pro-002`"}"
I’ve tried using the same model from different projects/accounts/billing accounts, but I’m getting the same error. All other regions seem to work. What might be the issue here?