Fine tuned model 500 server error rate limit

Luna_W · January 30, 2025, 8:46pm

When using the Python SDK to access the Gemini API, I get 500 InternalServerErrors when asynchronously sending more than ~15 requests per minute to a fine tuned model. The way that this error is triggered would make it seem like a rate limiting problem, but as far as I know fine tuned models have equivalent rate limits to the model they’re based on. Since I’m on a paid plan, and it’s based on Gemini 1.5 Flash, that limit should be 2000 requests per minute. The documentation around 500 errors isn’t helpful in the slightest and I can’t find anyone else who has this problem.

The inputs are very short, only a few words at most.

Siva_Sravana_Kumar_N · March 21, 2025, 6:28pm

Hi @Luna_W,

Welcome to forum, does the issue still persists, if still exists please let us know.

Thank you!

Topic		Replies	Views
Always getting InternalServerError Gemini API gemini-api , model	1	59	October 14, 2024
Getting 500s With Experimental Models Gemini API api	3	149	March 6, 2025
Persistent An internal error 500 Gemini API feedback , bug	3	447	June 15, 2025
Gemini-2.5-flash-preview-04-17 returning Internal Error 500 Gemini API gemini-flash	2	298	May 8, 2025
Is `gemini-2.5-flash-preview-04-17` no longer available? Gemini API gemini-flash	2	251	May 8, 2025

Fine tuned model 500 server error rate limit

Related topics