Translations:Batch Normalization Accelerating Deep Network Training/17/en: Difference between revisions

Latest revision as of 21:40, 27 April 2026

Information about message (contribute)

This message has no documentation. If you know where or how this message is used, you can help other translators by adding documentation to this message.

Message definition (Batch Normalization Accelerating Deep Network Training)

The authors also observed that {{Term|batch normalization}} reduces the dependence on precise initialization, permits higher {{Term|learning rate|learning rates}} without divergence, and provides a mild {{Term|regularization}} effect because each sample's normalized value depends on the other samples in its {{Term|mini-batch}}, introducing stochastic noise.

The authors also observed that batch normalization reduces the dependence on precise initialization, permits higher learning rates without divergence, and provides a mild regularization effect because each sample's normalized value depends on the other samples in its mini-batch, introducing stochastic noise.

Revision as of 00:31, 27 April 2026 (view source) FuzzyBot (talk \| contribs) (Importing a new version from external source)		Latest revision as of 21:40, 27 April 2026 (view source) FuzzyBot (talk \| contribs) (Importing a new version from external source)
Line 1:		Line 1:
	The authors also observed that batch normalization reduces the dependence on precise initialization, permits higher learning rates without divergence, and provides a mild regularization effect because each sample's normalized value depends on the other samples in its mini-batch, introducing stochastic noise.		The authors also observed that {{Term\|batch normalization}} reduces the dependence on precise initialization, permits higher {{Term\|learning rate\|learning rates}} without divergence, and provides a mild {{Term\|regularization}} effect because each sample's normalized value depends on the other samples in its {{Term\|mini-batch}}, introducing stochastic noise.