Variance of Supervised Models in Statistical Learning
Variance refers to the sensitivity of a model () to different training data sets. In other words, it is the difference in the fits of between different training data sets. When we use data sets to fit our statistical learning method, each one will produce a different ; we want an that varies very little between them. High variance means is often more flexible and captures the points of that particular data set well, but changing points for a different training data set will result in large changes in due to overfitting. Lower variance results in less flexibility, but does not change as much between different training data sets.
0
3
Contributors are:
Who are from:
Tags
Data Science
Related
Overfitting a supervised statistical model
Training Error and Test Error
Generalizability of a supervised statistical model
Underfitting a supervised statistical model
Measuring Model Complexity: Rademacher complexity
Bias of Supervised Models in Statistical Learning
Variance of Supervised Models in Statistical Learning
Falsifiability of Machine Learning Models
Notions of Model Complexity
Relationship Between Dataset Size and Model Complexity