The GRU (Cho et al., 2014) simplifies the LSTM by merging the cell state and hidden state and using only two gates: