Translations:Softmax Function/30/en: Difference between revisions

Revision as of 22:02, 27 April 2026

Information about message (contribute)

This message has no documentation. If you know where or how this message is used, you can help other translators by adding documentation to this message.

Message definition (Softmax Function)

# A neural network produces raw {{Term|logits}} <math>\mathbf{z}</math> from its final linear layer.
# Softmax converts {{Term|logits}} to probabilities: <math>\hat{\mathbf{y}} = \sigma(\mathbf{z})</math>.
# The predicted class is <math>\hat{c} = \arg\max_k \hat{y}_k</math>.
# Training uses [[Cross-Entropy Loss]] applied to the predicted distribution and the true labels.

A neural network produces raw logits $\mathbf{z}$ from its final linear layer.
Softmax converts logits to probabilities: $\hat{\mathbf{y}} = \sigma(\mathbf{z})$ .
The predicted class is $\hat{c} = \arg\max_k \hat{y}_k$ .
Training uses Cross-Entropy Loss applied to the predicted distribution and the true labels.