Translations:Recurrent Neural Networks/12/en
Training an RNN requires computing gradients of the loss with respect to the shared weights. backpropagation through time (BPTT) "unrolls" the RNN across time steps, producing a deep feedforward network with shared weights, and then applies standard backpropagation.