Learn Before
Concept
SEQ2MIXUP
It generalizes MIXUP for sequence transduction tasks in two ways - the "hard" version samples a binary mask and picks from one of two sequences at each token position, while the "soft" version softly interpolates between sequences based on a coefficient sampled from a Bernoulli prior. The "soft" version is found to outperform the "hard" version and earlier interpolation-based techniques.
0
1
Updated 2022-05-20
Tags
Data Science