Dataflow for image preprocessing & training with GPU optimization

Dan_D · March 31, 2022, 3:11am

Hi, hopefully this is a common problem with a common practice solution. In summary, what is the best way to preprocess images and train a neural network with TF/Keras such that GPU usage is optimized during training? Assume the image dataset does not fit into memory.

The current implementation uses ImageDataGenerators for on-the-fly preprocessing during model training, but the CPU is maxed out and the GPU is hardly used. Is this due to the ImageDataGenerators? If yes, what is typical way to get around this?

The images are JPEGs and the preprocessing consists of resizing to 150 x 150 and dividing the pixel values by 255. One approach would be to do the preprocessing separately on the CPU and store the results to disk. That would be fine. But what would be the storage format? Eg. NumPy in CSV? And what would be the way to stream the preprocessed images into the fit() method so that GPU usage is optimized and not bottlenecked by the CPU? Eg. a tf dataset?

For what it is worth, this is all on Kaggle.

All thoughts are welcome - thanks!

Bhack · March 31, 2022, 2:49pm

We are collecting many image preprocessing layers at:

Bhack · March 31, 2022, 2:53pm

More in general about the CPU preprocessing and the model computational occupancy see our thread at:

https://github.com/keras-team/keras-cv/pull/146#issuecomment-1048128659

Dan_D · April 1, 2022, 7:18pm

Update before closing this one.

Resizing the images prior to training cut the training time approximately in half even though ImageDataGenerator is still used for rescaling the pixel values.

The CPU is still maxed out and bottlenecking the GPU but further optimization (dataflow) is not necessary given the improvement that has already been achieved.

Topic		Replies	Views
Is it possible to decode file on GPU? General Discussion keras , gpu , help_request	2	900	December 21, 2022
GPU usage dips after each epoch to 0% General Discussion gpu , help_request	2	1034	December 10, 2021
Train a model built-on "custom dataloader" with multi-GPU support General Discussion distributed-training , keras , help_request	2	2247	February 28, 2022
Tensorflow model (with input mel spectrograms) on kaggle runs on CPU instead of GPU General Discussion models , gpu , help_request	1	1477	October 5, 2021
Out of GPU memory when training CNN with large images TensorFlow gpu , help_request	1	1887	June 7, 2022

Dataflow for image preprocessing & training with GPU optimization

Related topics