Translations:Deep Residual Learning for Image Recognition/20/en
On CIFAR-10, the authors trained networks with over 1,000 layers, showing that extremely deep residual networks could still be optimized, though a 1202-layer network showed mild overfitting compared to a 110-layer variant due to the small dataset size.