XLA Compilation at the time of inferencing

Akash_Mishra · September 30, 2021, 2:09pm

Hi ,

I want to check whether XLA compilation can be performed while inferencing with loading saved graph model. what I am performing is applying @tf.function( jit_compile=True) while inferencing with loaded saved model graph. I am seeing performance improvement in throughput by 7x .

Is above doing is correct way or I am doing it wrong please suggest ?

Kiran_Sai_Ramineni · July 14, 2023, 10:07am

Hi @Akash_Mishra, The approach you are following is correct, but if you want to export your functions to other backend platforms to perform xla_compile refer to this documentation. Thank You.

Topic		Replies	Views
What's best practice for using XLA? Keras model-compil , tfconfig , tfdata	2	727	November 30, 2023
Advice on speeding up very slow XLA Compile Times with `jit_compile=True` Keras cuda , keras , tfdata , xla	0	421	March 11, 2024
JIT compile set to false in compile function and set to true globally Keras models , model-compil , tfconfig , xla	1	248	November 7, 2024
How to properly deploy Keras models for inference in Python? General Discussion models , keras , help_request	7	2149	March 31, 2022
Xla (jit_compile flag) and gpu memory usage General Discussion gpu , xla	7	2631	October 23, 2021

XLA Compilation at the time of inferencing

Related topics