Translations:Recurrent Neural Networks/18/en
Conversely, when the spectral radius exceeds 1, gradients can grow exponentially — the exploding gradient problem. Exploding gradients are typically handled by gradient clipping (capping the gradient norm at a threshold), but vanishing gradients require architectural solutions.