Concept

IOB Encoding for Multiple Entity Types

The standard IOB (Inside, Outside, Beginning) tagging scheme uses the tag set B,I,O{\mathrm{B}, \mathrm{I}, \mathrm{O}} to identify entities. When extending this to multiple entity types, the 'B' and 'I' tags are specialized for each class. For example, B-ORG\mathrm{B}\textrm{-}\mathrm{ORG} signifies the beginning of an organization's name, I-ORG\mathrm{I}\textrm{-}\mathrm{ORG} indicates a word inside an organization, and O\mathrm{O} means the word does not belong to any named entity. This specialization expands the total tag set size to 2×N+1{}2 \times N + 1, where NN is the number of distinct entity classes.

Image 0

0

1

Updated 2026-04-18

Tags

Data Science

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.1 Pre-training - Foundations of Large Language Models