FuzzyBot: Importing a new version from external source

2026-04-27T00:35:51Z

Importing a new version from external source

New page

In classical {{Term|gradient descent}}, the full gradient of the {{Term|loss function}} is computed over the entire training set before each parameter update. When the dataset is large this becomes prohibitively expensive. SGD addresses the problem by estimating the gradient from a single randomly chosen sample (or a small '''{{Term|mini-batch}}''') at each step, trading a noisier estimate for dramatically lower per-iteration cost.

Translations:Stochastic Gradient Descent/3/en - Revision history

FuzzyBot: Importing a new version from external source