Concept

Scaling Model Capacity with Language-Specific Parameters

a layer whose parameters are split by language or language group based on similarity in vocabulary.

Each translation direction only accesses a subset of these parameters, allowing the model capacity to scale without significantly affecting the training and inference time.

0

1

Updated 2022-06-05

Tags

Science