Choosing appropriate multiclass loss and metric

mohamad · December 1, 2021, 6:34pm

I have a multi-class classification problem and the model I train gives an overall reasonable accuracy but fails in one or a few smaller classes (very low performance only on that class). I think part of the reason is that I use regular CrossEntropy as my loss function so I am optimizing for overall accuracy and not enforcing minimum per-class performance in any way. Is there any loss function in Keras that can do that for me? In other words, how do I force my model to not just focus on overall accuracy but ensure that I will have a reasonable performance in all classes?

Ashwini_Gadag · December 2, 2021, 6:04am

Use sigmoid activation on your output layer and binary_crossentropy as loss function.

softmax gives probability distribution around n-classes and it works well when the classes are mutually exclusive and not their probabilities.

Where as sigmiod provides independent probabilities.

Bhack · December 2, 2021, 1:58pm

Check also if these classes are imbalanced:

Other then these you can also use some imbalanced loss.

mohamad · December 3, 2021, 4:48pm

Thanks but my classes are mutually exclusive so I don’t think this works. Each observation belongs to one class only (out of 9 classes). It’s just that the model works well on 8 classes and sacrifices performance in one or a few classes.

mohamad · December 3, 2021, 5:05pm

Thanks. I have tried re-weighting and it slightly helps the problematic classes but doesn’t solve the problem. I am thinking maybe this is an indication that the problematic classes have an underlying issue and are just difficult to differentiate from the rest but don’t know how to test this hypothesis.

Bhack · December 3, 2021, 6:28pm

You can try to check if your model could overfit you data training on your current train+test data.

They you can check if you train data points for these specific classes are representative enough of the test dataset for the same classes.

Topic		Replies	Views
Multi-label classification General Discussion help_request	15	2477	June 18, 2021
Semantic segmentation: How to maximize precision for the minor class in an imbalanced training dataset TensorFlow keras , model_garden , performance , help_request	1	976	June 29, 2021
Managing dataset with unbalanced labels General Discussion datasets , keras , epoc	4	466	December 26, 2023
Focal loss function for multilabel classification General Discussion datasets , help_request	1	2648	April 22, 2022
Accuracy metrics in multi-class multi-label classification General Discussion model-predict , tf-train , help_request	10	650	April 5, 2024

Choosing appropriate multiclass loss and metric

Related topics