Concept
Task-specific model design
- PLM produces a sequence of vectors in the contextual representation.
- Then, one or more task-specific layers are added on the top to generate the final output for the target task.
- The choice of the architecture of task-specific layers depends on the nature of the task, e.g., the linguistic structure of text needs to be captured.
- RNNs can capture word orders, CNNs are good at recognizing patterns such as key phrases, attention mechanisms are effective to identify correlated words in the text, Siamese NNs are used for text matching tasks, and GNNs can be a good choice if graph structures of natural language (e.g., parse trees) are useful for the target task.
0
1
Updated 2022-05-29
Tags
Data Science