Concept

Gini index in information Theory

Gini index measures the node purity - total variance across all the K classes. A small value of Gini index implies that one class dominates the region.

G=k=1Kp^mk(1p^mk)G=\sum_{k=1}^{K} \hat{p}_{m k}\left(1-\hat{p}_{m k}\right)

where pmkp_{mk} denotes the proportion of training observations in the mmth region that are from the kkth class. The index measures total variance across the KK classes. The index is minimized when pmkp_{mk} is close to zero or 1.

0

3

Updated 2020-03-06

Tags

Data Science