Translations:Softmax Function/14/en

    From Marovi AI
    Revision as of 00:30, 27 April 2026 by FuzzyBot (talk | contribs) (Importing a new version from external source)
    (diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
    • $ T \to 0 $: The distribution collapses to a one-hot vector selecting the argmax — equivalent to a hard decision.
    • $ T = 1 $: Standard softmax.
    • $ T \to \infty $: The distribution approaches uniform — all classes become equally likely.