HierarchicalCopyAllReduce is extremely slow

sx_f April 29, 2021, 9:15am 1

I found HierarchicalCopyAllReduce is much slower than NcclAllReduce, related issues of multi-Gpus training · Issue #971 · google/automl · GitHub. Any ideas?

1 Like

Topic		Replies	Views
Significant drop in the model's performance metric (Top K Accuracy) when we go from 1 GPU to 2 or 4 GPUs TensorFlow recommenders , gpu	0	48	August 8, 2024
Training speed of cnn model is too slow even after using google colab General Discussion models , gpu	2	678	November 16, 2023
Tensorflow 2.17 slow on apple silicon when training neural nets TensorFlow tfkeras	3	488	November 13, 2024
TF2 Keras OOM Training ImageNet with MobileNet V2 (4-GPU) General Discussion distributed-training , keras , gpu	1	1158	November 15, 2023
Tf.stack is hanging General Discussion tensorflow	1	85	April 2, 2024