当前位置：网站首页>Week 4 Data analysis algorithms-Linear models for regression-Bias-Variance Analysis(Part B)

Week 4 Data analysis algorithms-Linear models for regression-Bias-Variance Analysis(Part B)

2022-07-17 23:25:00 【Jinzhou hungry bully】

Catalog

One 、 deviation （Bias） And variance （Variance） The relationship between （ Measure the generalization performance of the algorithm ）

Two 、 The generalization error （Generalization error）

3、 ... and 、 Regularization parameters Editor and bias and variance The relationship between

Four 、 Bias-Variance in Regression（Tutorial）

One 、 deviation （Bias） And variance （Variance） The relationship between （ Measure the generalization performance of the algorithm ）

1、 Definition ：

（1） deviation ： It refers to the error between the predicted output of the model and the label （indicate the accuracyof the models）, Measure whether we find the best model or how close we are to the best model , The larger the deviation, the more likely it is to underfit .

（2） variance ： It refers to the sensitivity error of the model to small fluctuations in the data set （indicate how consistent the models are）, The difference between the prediction results of the model and the real data is greater as the data increases , At this time, the difference between the prediction result of the model and the best model is called variance , The larger the variance, the easier it is to fit .

2、 Relationship

（1） High deviation and low variance = Under fitting model

（2） Low deviation and high variance = Over fitting model （ Highly complex models ）

（3） Low deviation and low variance = Best fit model （ The best model ）

（4） High training accuracy and low testing accuracy （ Exceed sample accuracy ）= High variance = Over fitting model = More model complexity

（5） Low training accuracy and low test accuracy （ Exceed sample accuracy ）= High deviation = Under fitting model

3、 How to solve the deviation （Bias） Larger and variance （Variance） Bigger problem

（1） The deviation is quite large , Under fitting ：

Increase feature data to improve the degree of fitting , Avoid under fitting .
Increased complexity of the model （ increase M Value ） Improve the degree of fitting , Avoid under fitting
Try to get more features
Try increase polynomial features （ Similar to the second point ）
Try Reduce the degree of regularization

（2） The variance is relatively large , Over fitting ：

Add data, especially big data , It helps to reduce the complexity of the model , Improve the prediction ability of the model in big data , Avoid overfitting
Add regular （Regularisation）, At the same time, find out about w The loss function and w The minimum value of ,w The smaller it is , The smoother the curve , The better the fitting degree of the model
Try Reduce the number of features
Reduce the complexity of algorithm model , For example, pruning the decision tree 、 Reduce the number of layers of neural network, etc ;

4、 Reduce bais and variance Two machine learning algorithms （Bagging and Boosting）

（1）Boosting Reduce model deviation

Method ：

Bagging The algorithm can be processed in parallel （K Submodel ）, and boosting The idea is an iterative method
Boosting Every time I train, I pay more attention to the example of the last classification error , Give these examples of classification errors greater weight , The goal of the next training is to be able to more easily identify the examples of the last classification error

（2）Bagging Reduce model variance

Method ：

adopt K The second one is put back for sampling , Training K Submodel （ Each random sampling training 1 A model ）
Yes K Model results Voting/Average The fusion

Two 、 The generalization error （Generalization error）

In machine learning , An indicator used to measure the accuracy of a model on unknown data , It's called generalization error （Generalization error）. An integration model (f) In the unknown dataset (D) Generalization error on Error( f ; D ), By variance (var), deviation (bias) And noise (ε) Joint decision . The variance is determined by the stability of the model , The deviation is determined by the fitting degree on the training set , Noise is uncontrollable , The smaller the generalization error , The more ideal the model is .

$Error(f; D) = bias^2 + var + \varepsilon^2$

For test samples x Make y_D by x Tags in the dataset （ There may be noise that causes the marked value to differ from the true value ）,y by x Of True value ,f(x;D) In the training set D School models f stay x Upper Predictive value , Take regression as an example ：

Expectations of predicted values $\bar{y}(x)$ （ The average of all predicted values ） by ：

$\bar{y}(x) = E[f(x; D)]$

variance (var) Calculated as ：

$var(x) = E[(f(x;D) )- \bar f(x;D))^2]$

Variance represents the variance on a test data set , It is the relationship between the predicted value and its average value on the test data set , It has nothing to do with the real value .

noise ( ε ) The calculation of ：（ Is the error between the real value and the actual marked value ）

$\varepsilon ^2 = E[(y_D - y)^2]$

Expectation of the square difference between the marked value and the real value , Generally, the noise value is ignored .

deviation (bias) The calculation of ：

$bais^2 (x)= (\bar f(x;D) - y)^2$

Derivation of generalization error formula ：（ Assume that the noise expectation is zero, that is ）

because $\bar{f}(x) = E[f(x; D)]$ And $E[\bar{f}(x)] = \bar{f}(x)$ （ $\bar{f}(x)$ Is a numerical value ）, So the first red part is 0

Considering that the noise does not depend on , Noise expectation is 0, So the second reddened part is also 0.

Deviation measures the deviation between the expected value of the predicted value of the learning algorithm and the real result , It describes the fitting ability of the algorithm itself to the data, that is, the matching degree between the samples of the training data and the trained model ; Variance measures the change of learning performance caused by the change of training set , Describe the impact caused by data disturbance ; Noise represents the lower bound of generalization ability of any learning algorithm , Describes the difficulty of the learning problem itself .

The relationship of generalization error curve is as follows ：

Picture description here

3、 ... and 、 Regularization parameters $\lambda$ And bias and variance The relationship between

1、 When $\lambda$ Take the smaller value , The complexity of the model increases ,bias smaller ,variance Bigger

2、 When $\lambda$ When it gets bigger , The complexity of the model becomes smaller , The model becomes simple ,bias Bigger ,variance smaller , And the change trend of test error is similar to that of generalization error

Four 、 Bias-Variance in Regression（Tutorial）

1、 Polynomial model complexity M(ploynomial degree) And under fitting 、 The relationship between over fitting

In polynomial ,M The smaller the model, the simpler , The more likely it is to underfit （ High at this time bias, high variance）, For example, pictures 1 It's a lack of fit ,M The larger the model, the more complex , The more likely fitting occurs ( High at this time variance, low bias), For example, pictures 10 It's over fitting , picture 5 For best fitting （M=5）：

picture 1

picture 5

picture 10

2、 The variation trend of test error is similar to that of generalization error , All by bias and variacne Joint decision , We have the following diagram ：

As the training goes on ,bias Gradually smaller ,variance Gradually get bigger , The generalization error also gradually increases .

原网站

版权声明
本文为[Jinzhou hungry bully]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/198/202207150850044449.html

当前位置：网站首页>Week 4 Data analysis algorithms-Linear models for regression-Bias-Variance Analysis(Part B)

Week 4 Data analysis algorithms-Linear models for regression-Bias-Variance Analysis(Part B)

One 、 deviation （Bias） And variance （Variance） The relationship between （ Measure the generalization performance of the algorithm ）

Two 、 The generalization error （Generalization error）

3、 ... and 、 Regularization parameters $\lambda$ And bias and variance The relationship between

Four 、 Bias-Variance in Regression（Tutorial）

边栏推荐

猜你喜欢

随机推荐

当前位置：网站首页>Week 4 Data analysis algorithms-Linear models for regression-Bias-Variance Analysis(Part B)

Week 4 Data analysis algorithms-Linear models for regression-Bias-Variance Analysis(Part B)

One 、 deviation （Bias） And variance （Variance） The relationship between （ Measure the generalization performance of the algorithm ）

Two 、 The generalization error （Generalization error）

3、 ... and 、 Regularization parameters And bias and variance The relationship between

Four 、 Bias-Variance in Regression（Tutorial）

边栏推荐

猜你喜欢

随机推荐

3、 ... and 、 Regularization parameters $\lambda$ And bias and variance The relationship between