Concept

Information Gain Ratio

An improvement proposed by Ross Quinlan that corrects for the disadvantage of Information Gain by taking into account the number of values a feature can take. Information Gain Ratio (IGR) is just the ratio of IG and Intrinsic Value (IV) which is defined as :

IV(X,Y)=Σvvalues(Y){XY==v}Xlog2{XY==v}XIV(X, Y) = - \Sigma_{v \in values(Y)} \frac{|\{X_Y == v\}|}{|X|} \log_2 \frac{|\{X_Y == v\}|}{|X|} Thus, Information Gain Ratio is :

IGR(X,Y)=IG(X,Y)IV(X,Y)IGR(X,Y) = \frac{IG(X,Y)}{IV(X,Y)}

0

2

Updated 2020-04-05

Tags

Data Science

Related