Categorical Crossentropy Label Smoothing

coco · April 2, 2024, 2:02pm

Hello,

I was trying to figure out what the label_smoothing parameter did for the loss “Categorical Crossentropy” and looking at the code, I came across this (keras/keras/losses/losses.py at v3.1.1 · keras-team/keras · GitHub):

if label_smoothing:
        num_classes = ops.cast(ops.shape(y_true)[-1], y_pred.dtype)
        y_true = y_true * (1.0 - label_smoothing) + (
            label_smoothing / num_classes
        )

The calculation of num_classes assumes that the classes are located on the -1 axis, but the categorical_crossentropy function takes “axis” as a parameter in order to know which axis corresponds to the classes.
I don’t understand why we don’t just use :
num_classes = ops.cast(ops.shape(y_true)[axis], y_pred.dtype)

Is there something I’ve misunderstood that explains this, or is it an error?

aniruthraj · November 6, 2024, 10:25am

Hi @coco,

Sorry for the delay in response.

num_classes = ops.cast(ops.shape(y_true)[-1], y_pred.dtype)

This axis=-1 is the last dimension of y_true corresponds to the classes which is common in one-hot encoding and axis parameter in categorical crossentropy defines the axis for loss calculation, not the number of classes. As far as I’m aware, for label smoothing the number of classes should be determined from the dimension that corresponds to the classes of last axis, while using axis to calculate num_classes could be incorrect if axis doesn’t match the classes dimension that is why [-1] is used.

Hope this helps.Thank You.

Topic		Replies	Views
Managing dataset with unbalanced labels General Discussion datasets , keras , epoc	4	451	December 26, 2023
Data format for training General Discussion api , keras	4	720	March 14, 2023
AUC for multi-class classification General Discussion api , keras , help_request	3	1858	March 15, 2022
Unexpected value of binary_crossentropy loss function in classifier network with two outputs Keras models , help_request	2	645	August 1, 2022
Help with Loss Function Errors General Discussion keras , tfdata , help_request	5	1534	February 7, 2022

Categorical Crossentropy Label Smoothing

Related topics