Translations:Adam A Method for Stochastic Optimization/18/en

    From Marovi AI

    where $ \alpha $ is the step size (learning rate) and $ \epsilon $ is a small constant for numerical stability.