Learn Before
Concept

ELMo

ELMo is a context-sensitive word embedding model designed to address the polysemy problem. It utilizes a CNN to transfer words into raw vectors, followed by a bidirectional LSTM to calculate a weighted average of the hidden layers, yielding the final word representations. Notably, when ELMo is applied to downstream natural language processing tasks, it requires crafting task-specific architectures and freezes the parameters of its pretrained model during supervised learning.

0

1

Updated 2026-05-26

Tags

Data Science

D2L

Dive into Deep Learning @ D2L