Learn Before
Concept
ELMo
ELMo is a context-sensitive word embedding model designed to address the polysemy problem. It utilizes a CNN to transfer words into raw vectors, followed by a bidirectional LSTM to calculate a weighted average of the hidden layers, yielding the final word representations. Notably, when ELMo is applied to downstream natural language processing tasks, it requires crafting task-specific architectures and freezes the parameters of its pretrained model during supervised learning.
0
1
Updated 2026-05-26
Contributors are:
Who are from:
Tags
Data Science
D2L
Dive into Deep Learning @ D2L