Different metrics in model.fit and model.predict with the same dataset

komo · June 19, 2023, 2:18am

ds_train is a dataset containing a single sample for an overfitting scenario.


history = model.fit(
    x=ds_train,
    epochs=cfg.epochs*1,
    validation_data=ds_val,
    verbose=1
)

Results in:

But metrics in model.evaluate(ds_train) right after model.fit() is much worse:

What can be the reason for such a difference?

Kiran_Sai_Ramineni · June 19, 2023, 8:51am

Hi @komo, The difference in results is due to model.fit calculates the values after the forward pass and then updates the weights via back-propagation then model.evaluate calculates weights based upon the updated weights in back-propagation. while using the model.fit you can use custom callbacks,For example:

class CustomMonitoring(keras.callbacks.Callback):
  def on_train_batch_end(self, batch, logs=None):
    loss, acc = self.model.evaluate(train_images, train_labels,batch_size=1024)
    print('For end batch {}, loss is {:7.2f} and acc is {}.'.format(batch, loss, acc))

to update weights at the end of each batch. so that you can get the same results when using model.fit and model.evaluate. Please refer to this gist for working code example. Thank You.

Topic		Replies	Views
KERAS model.fit training progress printout: training vs validation values Keras datasets , keras-model , training	1	187	June 11, 2024
Different Results for model.evaluate() compared to model() General Discussion models , help_request	9	4000	November 1, 2021
What is the approach for training and validation metrics evaluation while writing code from scratch? General Discussion api , keras , help_request , metrics	1	610	November 10, 2023
Discrepancy between results reported by TensorFlow model.evaluate and model.predict General Discussion api , keras , model	0	1058	August 1, 2022
Model.fit and model.predict on a single sample gives different results General Discussion models	0	394	June 19, 2023

Different metrics in model.fit and model.predict with the same dataset

Related topics