Concept
Scaling Model Capacity with Language-Specific Parameters
a layer whose parameters are split by language or language group based on similarity in vocabulary.
Each translation direction only accesses a subset of these parameters, allowing the model capacity to scale without significantly affecting the training and inference time.
0
1
Updated 2022-06-05
Tags
Science