Model training fails with generators

aa_bb · May 6, 2024, 9:35am

Hello,

I have a model that i can train with normal model.fit(). Everything works well and training and validation accuracy rises as expected.

The problem is that i want to use generators then model is unable to train, accuracy goes back and forth between some fixed low values.

I do not make any changes to data or the model. I just add the generators like this:

class DataGenerator(Sequence):
    def __init__(self, x, y, batch_size):
        self.x = x
        self.y = y
        self.batch_size = batch_size
        self.num_samples = x.shape[0]
        self.num_batches = int(np.floor(self.num_samples / self.batch_size))
    
    def __len__(self):
        return self.num_batches
    
    def __getitem__(self, index):
        start_idx = index * self.batch_size
        end_idx = (index + 1) * self.batch_size
        
        batch_x = self.x[start_idx:end_idx]
        batch_y = self.y[start_idx:end_idx]
        
        return batch_x, batch_y

train_generator = DataGenerator(train_x, train_y, 32)
val_generator = DataGenerator(test_x, test_y, 32)

mymodel.fit(train_generator, epochs=num_epoch, steps_per_epoch=len(train_generator), validation_data=val_generator, validation_steps=len(val_generator), callbacks=[model_checkpoint_callback, custom_print_samples])

Whereas this works:
mymodel.fit(train_x, train_y, epochs=999, batch_size=32, verbose=2, validation_data=(test_x, test_y), callbacks = [model_checkpoint_callback, custom_print_samples])

I know there was a bug like this with the fit_generator function. Could it be that the old bug also affects model.fit usage with generators?

Tensorflow version: 2.12.1

Kiran_Sai_Ramineni · May 20, 2024, 10:20am

Hi @aa_bb, In your DataGenerator code as you are iterating through the data based upon the indexing the model might be seeing the same data repeatedly within the batch, due to this you might be getting the low accuracy. When using model.fit the data will be automatically shuffled for each epoch. Could you please try by shuffling the data using the generator. Thank You.

jxk_hsj · May 21, 2024, 9:39am

Hello,
It seems like using generators with model.fit() is causing accuracy fluctuations compared to training without generators. While your generator implementation looks correct, there might be underlying issues. Ensure that the generator provides data Indigo Card correctly and matches the model’s input shape. Try debugging by checking batch outputs and experimenting with batch sizes or preprocessing techniques. Additionally, verify if there are any issues with the model architecture or training parameters.

Topic		Replies	Views
Keras Sequence running out of data when popping from list General Discussion api , keras , help_request	1	1002	January 29, 2024
Loss become nan after some epochs General Discussion datasets , keras , help_request	5	4196	February 13, 2023
Python Generator gives error in Kera's fit but trains fine without generator General Discussion api , keras , tfdata , help_request	1	564	November 10, 2023
Keras Custom Data Generator - Stuck on First Epoch, No Output? General Discussion keras , help_request	1	3014	November 2, 2023
Using flow_from_directory for segmentation TensorFlow models	1	1749	November 3, 2022

Model training fails with generators

Related topics