Tensorflow multiprossesing model predication

Hossein_Arjomandi · November 23, 2022, 4:43am

Hi.

I havw a simple MNIST Keras model to make predictions and save the loss. I am running on a server with multiple CPUs, so I want to use multiprocessing for speedup.

I have successfully used multiprocessing with some basic functions, but for model prediction these processes never finish, while using the non-multiprocessing approach, they work fine.

I suspect that the issue might be with the model, as there is a single model it cannot be used in different parallel processes, so I loaded the model in each process, but it did not work.

My code is this:

from multiprocessing import Process
import tensorflow as tf

#make a prediction on a training sample
def predict(idx, return_dict):
  x = tf.convert_to_tensor(np.expand_dims(x_train[idx],axis=0))

  local_model=tf.keras.models.load_model('model.h5')
  y=local_model(x)
  print('this never gets printed')
  y_expanded=np.expand_dims(y_train[train_idx],axis=0)
  loss=tf.keras.losses.CategoricalCrossentropy(y_expanded,y)
  return_dict[i]=loss

manager = multiprocessing.Manager()
return_dict = manager.dict()
jobs = []

for i in range(10):
    p = Process(target=predict, args=(i, return_dict))
    jobs.append(p)
    p.start()
    
for proc in jobs:
    proc.join()

print(return_dict.values())

The print line in the predict function is never shown and the problem is with the model. Even without loading the model in the function and using a global one, the problem still persisted.

I followed this this thread but it did not work. My questions are now these:

How to solve the model issue
Can I use the same X_train for all the processes?

Aniket_Dubey · July 5, 2024, 7:16am

Hi @Hossein_Arjomandi ,

Use TensorFlow’s tf.distribute Strategy

Import tf.distribute strategy.
Wrap your model training and evaluation in the strategy scope.
Refer the official Documentation For more details

Yes, you can definitely use the same x_train for all the processes. The data doesn’t change, and each process is calculating the loss for a different data point within x_train .

Thank You !

Topic		Replies	Views
What is `worker` and `use_multiprocessing` in `model.predict`? General Discussion models , keras , help_request	2	3960	July 11, 2022
How to load keras models in parralel? TensorFlow models , keras	1	561	February 9, 2024
Using Keras Sequence and model.fit multiprocessing Keras distributed-training , keras , model	1	909	November 22, 2023
keras model does not learn if using tf.data pipeline TensorFlow datasets , tfkeras , tfdata , tensorflow	2	35	November 26, 2024
Training multiple Keras models concurrently with MirroredStrategy Keras models , distributed-training , keras	4	1078	November 8, 2023

Tensorflow multiprossesing model predication

Related topics