How Hugging Face improved Text Generation performance with XLA

Sayak_Paul · December 6, 2022, 6:31am

A couple months back, we introduced full-blown XLA support to the TensorFlow text generation models in Transformers. Through these changes to incorporate XLA compatibility, we were able to significantly improve the speed of the text generation models ~ 100x faster than before.

We recently did a guest blog with the TensorFlow team to discuss the technical aspects that went into consideration for delivering this user experience.

Read all about this here:

Hengwen · June 8, 2023, 9:00am

Thank you for the excellent technical blog.
Learn a lot from it.

Topic		Replies	Views
Faster Text Generation with TensorFlow and XLA Show and Tell nlp , xla	0	1239	July 28, 2022
You Don't Know TensorFlow Show and Tell keras , xla , education	1	1067	March 25, 2023
Any benchmark to compare tensorflow code with XLA and without XLA to compare the performance difference between them? General Discussion xla	1	351	June 7, 2023
Demos to Keras Examples Show and Tell	0	871	March 28, 2022
Implementing Fastformer: Additive Attention Can Be All You Need Show and Tell keras , education	2	1379	September 6, 2021

How Hugging Face improved Text Generation performance with XLA

Related topics