当前位置:网站首页>Wu Enda machine learning chapter 1-2
Wu Enda machine learning chapter 1-2
2022-07-19 06:42:00 【Watermelon that loves programming】
Wu Enda machine learning 1-2 Chapter after class summary .
1-1 Welcome to 《 machine learning 》 Course
A brief introduction to machine learning , And the application of machine learning in life .
machine learning : Machine learning is near 20 A multidisciplinary interdisciplinary subject that has sprung up over the years , Probability theory 、 statistical 、 Approximation theory 、 Convex analysis 、 Algorithm complexity theory and other disciplines . Constantly improve your performance through learning .
The application of machine learning in life : data mining 、 Applications that cannot be programmed ( Such as unmanned aircraft 、 Handwriting recognition, etc )、 Private customization program ( Such as recommending procedures )
1-2 What is machine learning ?
Now the more formal definition of machine learning is : Computer programs begin with experience E Middle school learning , Solve a task T, Perform a performance measurement P, adopt P Measured in T It's because of experience E And improve . For example, for checkers , Experience E The program plays checkers with itself tens of thousands of times , Mission T Just playing checkers , Performance metrics P Is the probability of winning checkers with a new opponent .
1-3 Supervised learning
In class , Explain what supervised learning is based on an example of house price prediction : We give an algorithm a data set , It contains the correct answer . The purpose of the algorithm is to give more correct answers .
Supervised learning is divided into regression problems and classification problems .
The regression problem is to fit with a curve (x,y), Minimize the error . The classification problem is to predict the output of discrete values , To classify . So we can know , If it is a continuous curve, it is a regression problem , If it is a discrete value, it is a classification problem .
1-4 Unsupervised learning
The data we use in unsupervised learning is different from supervised learning , No labels , We don't know the meaning of every data in the dataset , Unsupervised learning is to classify data sets . This is the clustering algorithm . One example is Google news , Google news is to classify the collected news into various topics
One of the most interesting examples is the cocktail party Algorithm , Two or more people use the same microphone , Through machine learning , Separate their voices .
2-1 Model describes
2-1 In the video , By explaining the example of linear regression house price , To explain the description process of the model in supervised learning . Through the input of eigenvalues , To predict the value of house prices .
2-2 Cost function
Cost function ( Also known as the square error function ) The function of is to figure out how to fit the most likely straight line with our data . In order to make the predicted value after we input closer to the real value , If we have smaller parameters in the prediction function .
h(x) It's our prediction function , And the cost function makes it easy to calculate after derivation , Then I took one in front 1/2.
2-3 Cost function ( One )
This section describes the use of cost functions in simple examples , To feel its role .
We make theta1 by 1,theta0 by 0, Then the following straight line can be fitted , The value of its cost function is 0.
Let's assume again , Make theta1 by 0.5,theta0 by 0
, Then the following straight line can be fitted , And the value of the cost function is approximately equal to 0.58
We make theta1 by 0,theta0 by 0, Then the fitting line is horizontal , And the cost function is approximately equal to 2.3
We can also for theta1 Assign various values , Then the graph of its cost function is as follows . The optimization goal of our learning algorithm is to minimize the value of the cost function . Is shown in the figure below theta1 take 1 When .
2-4 Cost function ( Two )
if theta1 and theta0 Can be changed , Then the graph of the cost function is similar to this bowl graph .
Contour lines of each ellipse , It means that theta1 and theta0 Equal point , By taking all kinds of theta1 and theta0, To better fit the function .
2-5 gradient descent
When the gradient decreases, in order to reduce the cost function J To minimize the , Gradient descent method is a very common method , It is widely used in many fields of machine learning .
The problem is described as follows ,Outline Step for gradient descent .

Cost function and theta0、theta1 The figure is as follows .
Think of the figure as a mountain , Find the fastest way down the mountain .
The following figure shows the definition of gradient descent algorithm . We should do this step repeatedly , Until it converges .α Called learning rate , Used to control gradient descent , How big a step we took .α More details will be explained in the following chapters .theta0 and theta1 You want to update at the same time , This is very important

2-6 Summary of gradient descent knowledge points
This section focuses on gradient descent α And derivative term .
α If it's too small , You need to move a little , It will take a long time to reach the lowest point .
α If it is too big , It is likely to lead to failure of convergence or divergence , As shown in the figure below .

When the convergence point is reached , The next step of the gradient descent will not change theta Value .
2-7 The gradient of linear regression decreases
This section explains the algorithm of linear regression obtained by combining gradient descent and cost function .
This is the gradient descent algorithm and linear regression model learned before .
Expand the derivative term as follows .
The partial derivatives of different parameters are as follows .
The derivative term of each parameter is brought into the gradient descent algorithm ( Be careful theta1 and theta0 Simultaneous updating ).

Through continuous gradient descent algorithm , Adjust the parameters , Finally, get the fitted curve 
边栏推荐
- 简单irules编写 入门级
- 感知智能手机上用户的关注状态
- [antdv: Each record in table should have a unique `key` prop,or set `rowKey` to an unique.....
- Positional change of the eyeball during eye movements: evidence of translational movement
- Experiment 1 simple program design
- 实验二 类与对象定义初始化
- F5 GTM(一):DNS参数
- 2022/07/11 group 5 Ding Shuai's study notes day04
- UDP message structure
- [force buckle] design cycle queue
猜你喜欢

Positional Change of the Eyeball During Eye Movements: Evidence of Translatory Movement眼球运动过程中眼球的位

斑点检测 记录

Es aggregation analysis reports an error: "reason": "text fields are not optimized for operations

吴恩达机器学习第1-2章

机器人缝合手势识别和分类

用头部运动学习无姿态注视偏差

吴恩达机器学习第8-9章

DSL realizes automatic completion query

Learning non posture gaze deviation with head movement

机器学习篇-逻辑回归的分类预测
随机推荐
三维凝视估计,没有明确的个人校准2018
Experiment 1 simple program design
用头部运动学习无姿态注视偏差
Experiment 4 operator overloading and virtual functions
《PyTorch深度学习实践》-B站 刘二大人-day3
Restclient query document
Preparation of blast Library of rust language from scratch (2) -- detailed explanation of Blas matrix format
[antdv: Each record in table should have a unique `key` prop,or set `rowKey` to an unique.....
Busybox specified date modification temporarily does not require clock -w to write to hardware
吴恩达机器学习第3-4章
[force buckle] realize stack with queue
Learning video saliency from human gaze using candidate selection
MySQL MySQL calculates the number of weekends this year (Saturday and Sunday)
F5LTM(一)逻辑关系图
实验一 简单程序设计
感知智能手机上用户的关注状态
一种基于凝视的平板电脑手势控制系统的设计与实现
2022/07/12 learning notes (day05) cycle
山西省第二届网络安全技能大赛(企业组)部分赛题WP(四)
《PyTorch深度学习实践》-B站 刘二大人-day4