.tflite model - improve latency of Google Cloud bucket retrieval object for Inference

Rubens_Zimbres · July 25, 2024, 8:17pm

I 'm following the Image Classification code at MediaPipe.

My model runs successfully, it is smaller than EfficientNet. However the latency of inference I get is 5 seconds.

MediaPipe .tflite object, at:

[https://storage.googleapis.com/mediapipe-models/image_classifier/efficientnet_lite0/float32/1/effici...](https://storage.googleapis.com/mediapipe-models/image_classifier/efficientnet_lite0/float32/1/efficientnet_lite0.tflite)

… loads instantly, with an inference time of milliseconds.

I tried with a regional bucket and fine-grained permissions but it didn’t solve the problem. I am also using the https://storage.googleapis.com/xxxxxxxx/model.tflite Public URL but it didn’t solve the problem. CORS file of the bucket is configured as the following:

[
{
“origin”: [“https://your-example-website.appspot.com”],
“method”: [“GET”],
“responseHeader”: [“Content-Type”],
“maxAgeSeconds”: 1
}
]

Do you have any ideas how to improve object retrieval latency to milliseconds?

Topic		Replies	Views
Why is my custom model with mobile netv2 is so slow in inference time? General Discussion models , tflite , model_garden , help_request	2	1656	July 31, 2021
Tensorflow lite inference time General Discussion tflite , help_request	4	1100	July 23, 2021
Tensorflow Object Detection - Improve loading time? General Discussion tflite , help_request	3	815	June 25, 2021
Using gemini media prompt with storage file via firebase gives cors error Gemini API	2	81	August 8, 2024
Model Maker TF Lite Slow Inference General Discussion models , tflite , model_maker , help_request	11	5463	September 6, 2021

.tflite model - improve latency of Google Cloud bucket retrieval object for Inference

Related topics