1Cademy - BERT-based Architecture for Sequence Labeling

Learn Before

Sequence Labeling

Concept

BERT-based Architecture for Sequence Labeling

A common architecture for sequence labeling tasks like Named Entity Recognition (NER) uses BERT. The input sequence is first tokenized, and special tokens like [CLS] and [SEP] are added to the beginning and end, respectively. This sequence is then passed through the BERT model to obtain a contextualized hidden state representation for each token. Finally, a classification layer is placed on top of each token's hidden state (excluding the special tokens) to predict its corresponding label from a predefined set, such as {B, I, O} for NER.