Jump to content

Translations:Cross-Entropy Loss/35/en

From Marovi AI

Revision as of 23:34, 27 April 2026 by FuzzyBot (talk | contribs) (Importing a new version from external source)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

where $\alpha$ is a small constant (commonly 0.1). This prevents the model from becoming overconfident, improves calibration, and often yields better generalization. It is standard practice in training large image classifiers and transformer models.

Retrieved from "https://marovi.ai/index.php?title=Translations:Cross-Entropy_Loss/35/en&oldid=17562"