Translations:Stochastic Gradient Descent/34/en
- Robbins, H. and Monro, S. (1951). "A Stochastic Approximation Method". Annals of Mathematical Statistics.
- Bottou, L. (2010). "Large-Scale Machine Learning with Stochastic Gradient Descent". COMPSTAT.
- Kingma, D. P. and Ba, J. (2015). "adam: A Method for Stochastic Optimization". ICLR.
- Ruder, S. (2016). "An overview of gradient descent optimization algorithms". arXiv:1609.04747.