Translations:Softmax Function/10/en: Difference between revisions
(Importing a new version from external source) Tag: Manual revert |
(Importing a new version from external source) Tag: Manual revert |
||
| Line 1: | Line 1: | ||
As the gap between the largest logit and the others increases, the output approaches a one-hot vector. This "winner-take-most" behavior makes softmax well-suited for classification where a single class should dominate. | As the gap between the largest {{Term|logits|logit}} and the others increases, the output approaches a {{Term|one-hot encoding|one-hot}} vector. This "winner-take-most" behavior makes softmax well-suited for classification where a single class should dominate. | ||
Latest revision as of 23:34, 27 April 2026
As the gap between the largest logit and the others increases, the output approaches a one-hot vector. This "winner-take-most" behavior makes softmax well-suited for classification where a single class should dominate.