Learn Before
Reference
Attention is all you need; Attentional Neural Network Models | Łukasz Kaiser | Masterclass
A very good video explaining the transformer model: https://www.youtube.com/watch?v=rBCqOTEfxvg
0
1
Updated 2020-10-24
Tags
Data Science
Related
Neural Machine Translation by Jointly Learning to Align and Translate
Effective Approaches to Attention-based Neural Machine Translation
Attention Motivation
Example of how Attention is used in Machine Translation
The Illustrated Transformer
Attention Is All You Need
Attention is all you need; Attentional Neural Network Models | Łukasz Kaiser | Masterclass
Tensor2Tensor Intro
Transformer model
Transformer
Efficient Transformers: A Survey
Evaluation of Efficient Transformers