Learn Before
Concept
Categorical Imputation via Dummy Variables
A common imputation heuristic for missing categorical fields is to treat the absence of data itself as a distinct category. For example, if a column contains both valid string categories and NaN entries, it can be transformed into multiple binary dummy variables—where one specific column serves to indicate whether the original value was missing by using for missing and otherwise.
0
1
Updated 2026-05-01
Tags
D2L
Dive into Deep Learning @ D2L