Learn Before
Relation
Improvements to the FFN of a transformer
Here are a few modifications proposed to improve the feed-forward network (FFN) of the transformer:
- changes to the activation function
- Adapting the FFN for Larger Capacity
- Dropping FFN Layers
0
1
Updated 2022-05-26
Tags
Data Science