Concept

Dropping FFN Layers in transformers

It has been shown that the FFN layer can be mitigated completely without loss of performance, which decreases the complexity of the model

0

1

Updated 2022-05-26

Contributors are:

Who are from:

Tags

Data Science