The expected test MSE for a given value, , can always be decomposed into the sum of 3 quantities:
- the variance of ,
- the squared bias of ,
- the variance of the error
Or:
To minimize test MSE (the left of the equation), we want a technique that has both low bias and low variance.
In this parlance, variance refers to the amount by which would change if estimated with different data. More flexible methods have higher variance
Bias refers to the error associated with using a model that is too simple. More flexible methods have lower bias.
As a general rule, when we use flexible methods, variance will increase and bias will decrease. And when we use restrictive methods, bias will increase and variance will decrease. The goal of modeling is to find a method/model (for your problem/data set) that has low bias and low variance.