Learn Before
Concept

Categorical Imputation via Dummy Variables

A common imputation heuristic for missing categorical fields is to treat the absence of data itself as a distinct category. For example, if a column contains both valid string categories and NaN entries, it can be transformed into multiple binary dummy variables—where one specific column serves to indicate whether the original value was missing by using 11 for missing and 00 otherwise.

0

1

Updated 2026-05-01

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L