Translations:Softmax Function/31/en: Difference between revisions
(Importing a new version from external source) |
(Importing a new version from external source) Tag: Manual revert |
||
| (2 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
In practice, the softmax and cross-entropy are computed jointly for numerical stability (the '''log-softmax''' formulation), and the argmax at inference time can be applied directly to the logits without computing softmax at all. | In practice, the softmax and {{Term|categorical cross-entropy|cross-entropy}} are computed jointly for numerical stability (the '''log-softmax''' formulation), and the argmax at inference time can be applied directly to the {{Term|logits}} without computing softmax at all. | ||
Latest revision as of 23:34, 27 April 2026
In practice, the softmax and cross-entropy are computed jointly for numerical stability (the log-softmax formulation), and the argmax at inference time can be applied directly to the logits without computing softmax at all.