Concept

Classification Tree

Classification Tree is used to predict qualitative variables instead of quantitative variables. It is very similar to regression tree and use recursive binary splitting to generate the classification tree. But use Gini index G=k=1Kp^mk(1p^mk)G = \sum_{k=1}^{K} \hat{p}_{mk} (1-\hat{p}_{mk}) and entropy as the indicator. D=k=1Kp^mklogp^mkD = -\sum_{k=1}^{K} \hat{p}_{mk} log \hat{p}_{mk}

The small value of purity of node indicated that a node contains observations from almost the same category. Entropy is the same as Gini Index, if the mmth node is pure, the entropy is very small.

1

6

Updated 2020-03-06

Tags

Data Science