Deploying TensorFlow Vision Models in Hugging Face with TF Serving

Sayak_Paul · July 25, 2022, 4:33pm

Serving image models can be hard. One major problem is sending raw images in the request payloads. Then there’s a problem of training/serving skew.

In my latest blog post, I show how to locally deploy a ViT (Base-16) from Transformers that takes care of the above issues:

We compress the image as a bae64 encoded string, thereby reducing the size of the payload considerably.
We embed the preprcoessing and postprocessing ops within the serving model to reduce training/serving discrepancy.

Next up, we’ll learn how to scale these kinds of deployments with Docker and Kubernetes. If you’re like me, a fan of true serverless infra, there will be a piece on doing this stuff with Vertex AI too. Stay tuned for that

Topic		Replies	Views
Deploying 🤗 ViT on Vertex AI Show and Tell tfx , vertexai , tf-serving , transformers	0	2058	August 22, 2022
Serving Stable Diffusion in TF Serving Show and Tell serving , keras , generative-ai , tf-serving	1	1783	January 16, 2023
Automated TensorFlow model serving Show and Tell tfx , google-cloud , tf-serving , deployment	1	1513	September 28, 2022
Deploy Stable Diffusion with TF Serving and GKE resiliently! Show and Tell keras , stable-diffusion , tf-serving	0	884	April 30, 2023
TF Serving pipeline General Discussion help_request , tf-serving	1	677	July 19, 2023

Deploying TensorFlow Vision Models in Hugging Face with TF Serving

Related topics