`log_softmax` in Dense layer + `from_logits=True` in cross-entropy loss

khteh · October 17, 2025, 2:22am

I have seen some code uses the combination of log_softmax in the Dense layer with from_logits=True in the cross-entropy loss function in order to have a stable computation of the softmax. How does it compare with using linear activation in the Dense layer with from_logits=True in the cross-entropy loss function? Isn’t there duplicate “softmax” in the first case of using log_softmax in the Dense layer since the cross-entropy loss function will perform the softmax calculation if from_logits=True?

Topic		Replies	Views
Valid reasoning to get probability? NLP classification General Discussion nlp , datasets , help_request	7	1287	August 16, 2021
Backpropagation for all zero label vectors in softmax/crossentropy General Discussion api , keras , gradienttape	3	731	June 1, 2023
Basic text classification sample why need from_logits=True? TensorFlow models , keras , evaluation , accuracy	3	1231	June 13, 2023
Unexpected value of binary_crossentropy loss function in classifier network with two outputs Keras models , help_request	2	663	August 1, 2022
Eager execution does not produce good results wrt graph computation TensorFlow models , keras , tf-probability	0	563	September 14, 2022

`log_softmax` in Dense layer + `from_logits=True` in cross-entropy loss

Related topics