Translations:Adam A Method for Stochastic Optimization/31/en
- Kingma, D. P. & Ba, J. (2015). Adam: A Method for Stochastic Optimization. Proceedings of ICLR 2015. arXiv:1412.6980
- Duchi, J., Hazan, E., & Singer, Y. (2011). Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. JMLR 12.
- Loshchilov, I. & Hutter, F. (2019). Decoupled Weight Decay regularization. ICLR 2019. arXiv:1711.05101.