Learn Before
Concept
Cross-stitch Networks
Cross-stitch networks start out with two separate model architectures just as in soft parameter sharing. They then use what they refer to as cross-stitch units to allow the model to determine in what way the task-specific networks leverage the knowledge of the other task by learning a linear combination of the output of the previous layers. Their architecture can be seen in the figure below, in which they only place cross-stitch units after pooling and fully-connected layers.

0
1
Updated 2022-05-26
Tags
Data Science