I have an LSTM model with three different activation functions and three different optimizers. I obtained the trained and tested results for each of them using MAE and MAPE metrics. Now, I want to compare them to see which one works better. Should I just consider the tested values and compare them, or do I have to calculate the difference between the trained and tested values and then compare the result of this difference?
I considered only the tested values for each model.