Ablation experiments showed that the combination of batch normalization with higher learning rates and the removal of dropout produced the best results.