Learn Before
Adding Layers During Training in Residual Networks
A significant practical advantage of residual networks is the ability to dynamically add layers during the training process. Because the default behavior of a residual block is to let data pass through unchanged via the identity mapping, new layers initialized as identity functions can be inserted without disrupting the already learned representations. This technique can actively accelerate the training phase of very large neural networks.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Related
Recent Variants of ResNets
Advantages of ResNets
Plain vs. ResNets Convolutional Neural Network Architectures
Evaluate ResNet at different depths for ImageNet Classification
Evaluate ResNet models with other state-of-the-art models for ImageNet Classification
Shortcut’s technique for identity mapping
Deep Residual Learning for Image Recognition
Residual Mapping
ResNet Initial Layers
Highway Networks vs. Residual Networks
Influence of Residual Connections on Subsequent Architectures
Adding Layers During Training in Residual Networks
Accelerated Forward Propagation in Residual Networks