Concept

Training Large Dense Models

several strategies to increase the capacity of a sequence-to-sequence Transformer model in the context of multilingual machine translation.

0

1

Updated 2022-06-05

Tags

Science