当前位置：网站首页>The charm of SQL optimization! From 30248s to 0.001s

The charm of SQL optimization! From 30248s to 0.001s

2022-07-26 10:04:00 【Java confidant_】

Click on the official account , Practical technical articles Know in time

scene

The database I use is mysql5.6, Here is a brief introduction to the scene

The curriculum ：

create table Course(  
c_id int PRIMARY KEY,  
name varchar(10)  
)

data 100 strip

Student list :

create table Student(  
id int PRIMARY KEY,  
name varchar(10)  
)

data 70000 strip

Student transcript SC：

CREATE table SC(  
    sc_id int PRIMARY KEY,  
    s_id int,  
    c_id int,  
    score int  
)

data 70w strip

Purpose of inquiry ：

Search for Chinese test 100 Divided candidates

Query statement ：

select s.* from Student s where s.s_id in   

(select s_id from SC sc where sc.c_id = 0 and sc.score = 100 )

execution time ：30248.271s

dizzy , Why is it so slow , Let's check the query plan first ：

EXPLAIN   
  
select s.* from Student s where s.s_id in   

(select s_id from SC sc where sc.c_id = 0 and sc.score = 100 )

picture

Found no index used ,type Is full of ALL, So the first thing to think about is to build an index , The fields that are indexed are, of course where The field of the condition .

First give sc Tabular c_id and score Build an index

CREATE index sc_c_id_index on SC(c_id);  

CREATE index sc_score_index on SC(score);

Execute the above query statement again , Time is : 1.054s

fast 3w Many times , Greatly shorten the query time , It seems that index can greatly improve query efficiency , It is necessary to build an index , Many times I forget to index , When the amount of data is small, I don't feel at all , The optimization feels great .

however 1s It's still too long , Can it be optimized , Take a close look at the execution plan ：

picture

Check out the optimized sql:

SELECT  
    `YSB`.`s`.`s_id` AS `s_id`,  
    `YSB`.`s`.`name` AS `name`  
FROM  
    `YSB`.`Student` `s`  
WHERE  
    < in_optimizer > (  
        `YSB`.`s`.`s_id` ,< EXISTS > (  
            SELECT  
            FROM  
                `YSB`.`SC` `sc`  
            WHERE  
                (  
                    (`YSB`.`sc`.`c_id` = 0)  
                    AND (`YSB`.`sc`.`score` = 100)  
                    AND (  
                        < CACHE > (`YSB`.`s`.`s_id`) = `YSB`.`sc`.`s_id`  
                    )  
                )  
        )  
    )

How to view optimized statements ？

The method is as follows （ Execute... In the command window ）：

picture

Yes type=all

As I thought before , The sql The order of execution should be to execute the subquery first

select s_id from SC sc where sc.c_id = 0 and sc.score = 100

Time consuming ：0.001s

The results are as follows ：

picture

And then execute

select s.* from Student s where s.s_id in(7,29,5000)

Time consuming ：0.001s

That's pretty fast ,Mysql Actually, it's not to execute the inner query first , It's going to be sql Optimization becomes exists Clause , And there comes EPENDENT SUBQUERY,mysql First, execute the outer query , Execute the inner query again , So it's going to cycle 70007*8 Time .

So use connection query instead ？

SELECT s.* from   
Student s  
INNER JOIN SC sc  
on sc.s_id = s.s_id  
where sc.c_id=0 and sc.score=100

In order to re analyze the connection query , Delete index temporarily first sc_c_id_index,sc_score_index

The execution time is ：0.057s

Efficiency has improved , Look at the implementation plan ：

picture

There's a situation of connected watches here , I wonder if it's for sc Tabular s_id Build an index

CREATE index sc_s_id_index on SC(s_id);  
show index from SC

picture

Then execute the connection query

Time : 1.076s

It's even longer , What's the cause of the ？ View execution plan ：

picture

The optimized query statement is ：

SELECT  
    `YSB`.`s`.`s_id` AS `s_id`,  
    `YSB`.`s`.`name` AS `name`  
FROM  
    `YSB`.`Student` `s`  
JOIN `YSB`.`SC` `sc`  
WHERE  
    (  
        (  
            `YSB`.`sc`.`s_id` = `YSB`.`s`.`s_id`  
        )  
        AND (`YSB`.`sc`.`score` = 100)  
        AND (`YSB`.`sc`.`c_id` = 0)  
    )

It seems to be the connection query done first , Further where filter

Go back to the execution plan ：

picture

This is the first thing to do where filter , And make a watch , The execution plan is not fixed , So let's look at the standard sql Execution order ：

picture

Normally it's first join Proceed again where Filter , But our situation here , If first join, There will be a 70w Data sent join, So first execute where Filtration is a wise solution , Now in order to exclude mysql Query optimization , I wrote an optimized sql

SELECT  
    s.*  
FROM  
    (  
        SELECT  
            *  
        FROM  
            SC sc  
        WHERE  
            sc.c_id = 0  
        AND sc.score = 100  
    ) t  
INNER JOIN Student s ON t.s_id = s.s_id

I.e. execute first sc Filtration of the watch , Then connect the meter

The execution time is ：0.054s

And not built before s_id Index time is almost

View execution plan ：

picture

Extract first sc Connect the watch again , So it's much more efficient , The problem now is to extract sc And then there's the scan table , Now it's clear that there's a need for indexes

CREATE index sc_c_id_index on SC(c_id);  
CREATE index sc_score_index on SC(score);

Execute query again ：

SELECT  
    s.*  
FROM  
    (  
        SELECT  
            *  
        FROM  
            SC sc  
        WHERE  
            sc.c_id = 0  
        AND sc.score = 100  
    ) t  

INNER JOIN Student s ON t.s_id = s.s_id

The execution time is ：0.001s

This time is quite reliable , fast 50 times

Implementation plan ：

picture

We'll see , Extract first sc, Connect the watch again , They all use indexes .

Then let's do it again sql:

SELECT s.* from   
Student s  
INNER JOIN SC sc  
on sc.s_id = s.s_id  

where sc.c_id=0 and sc.score=100

execution time 0.001s

Implementation plan ：

picture

Here is mysql Optimized the query statement , First executed. where Filter , Then perform the connection operation , And they all use indexes .

Adjust the content to SC The table's data grows to 300W, Student scores are more discrete .

Look back at ：

show index from SC

picture

perform sql

SELECT s.* from   
Student s  
INNER JOIN SC sc  
on sc.s_id = s.s_id  

where sc.c_id=81 and sc.score=84

execution time ：0.061s

This time is a little slow

Implementation plan ：

picture

It's used here intersect Union operation , That is to say, the results of two indexes retrieval at the same time are combined , Look at the fields score and c_id The differentiation of , From a single field , The degree of differentiation is not very big , from SC Table to retrieve ,c_id=81 The result of the search is 70001,score=84 The result is 39425.

and c_id=81 and score=84 The result is 897, That is to say, the discrimination degree of these two fields is relatively high , Therefore, it will be more efficient to establish a joint index query , From another point of view , The data in this table is 300w, There will be more , In terms of index storage , It's not a small number , As the amount of data increases , Indexes can't all be loaded into memory , Instead, read from the disk , The more indexes there are , The more expensive it is to read the disk , Therefore, it is necessary to establish a multi column joint index according to the specific business situation , So let's try .

alter table SC drop index sc_c_id_index;  
alter table SC drop index sc_score_index;  
create index sc_c_id_score_index on SC(c_id,score);

Execute the above query statement

The time consumed is ：0.007s

This speed is acceptable

Implementation plan ：

picture

The optimization of this statement is now over

summary ：

mysql Nested sub query efficiency is really low
It can be optimized to join queries
When connecting tables , You can use first where Conditions filter the table , Then make a watch connection
（ although mysql Will optimize the join table statement ）
Build the right index , Set up multi column joint index if necessary
Learn to analyze sql Implementation plan ,mysql Would be right sql To optimize , So it's important to analyze the execution plan

Index optimization

The optimization of subquery is mentioned above , And how to index , And when multiple fields are indexed , The fields are individually indexed

Later, we found that it is more efficient to establish a federated index , Especially in the large amount of data , When the differentiation of single column is not high .

Single index

The query statement is as follows ：

select * from user_test_copy where sex = 2 and type = 2 and age = 10

Indexes ：

CREATE index user_test_index_sex on user_test_copy(sex);  
CREATE index user_test_index_type on user_test_copy(type);  
CREATE index user_test_index_age on user_test_copy(age);

Respectively for sex,type,age The fields are indexed , The amount of data is 300w

Query time ：0.415s

Implementation plan ：

picture

Find out type=index_merge

This is a mysql Optimization of multiple single column indexes , Use... For the result set intersect Union operation

Multi column index

We can be here 3 Multiple column indexes on Columns , Will table copy One for testing

create index user_test_index_sex_type_age on user_test(sex,type,age);

Query statement ：

select * from user_test where sex = 2 and type = 2 and age = 10

execution time ：0.032s

fast 10 Many times , And the more distinguishable the multi column index is , The more speed you increase

Implementation plan ：

picture

Left most prefix

Multi column indexes also have the leftmost prefix feature ：

Index will be used , The first field of the index sex To appear in where In the condition

Execute a statement ：

select * from user_test where sex = 2  
select * from user_test where sex = 2 and type = 2  
select * from user_test where sex = 2 and age = 10

Index overlay

That is, all the columns of the query are indexed , In this way, you don't need to go to disk to get the data of other columns when you get the result set , Return index data directly

Such as ：

select sex,type,age from user_test where sex = 2 and type = 2 and age = 10

execution time ：0.003s

It's much faster than taking all the fields

Sort

select * from user_test where sex = 2 and type = 2 ORDER BY user_name

Time ：0.139s

Building an index on the sorting field will improve the efficiency of sorting

create index user_name_index on user_test(user_name)

Finally, attach some sql Tuning summary , There will be time for further study

Try to define the column type as a numeric type , And the length should be as short as possible , Such as primary key and foreign key , Type fields and so on
Create a single column index
Set up multi column joint index as needed

When a single column is filtered, there is a lot of data , So the efficiency of index will be lower , That is, the distinction between columns is low , So if you index multiple columns , So many columns are more distinguishable , There will be a significant increase in efficiency .

Build coverage index according to business scenario

Query only the fields required by the business , If these fields are overwritten by the index , Will greatly improve the query efficiency

Multiple table join fields need to be indexed

This can greatly improve the efficiency of table connection

where You need to index the condition fields
The sort field needs to be indexed
The group field needs to be indexed
Where Conditionally, don't use operands , To avoid index failure

Reference article

http://www.cnblogs.com/linfangshuhellowored/p/4430293.html
http://tech.meituan.com/mysql-index.html
http://www.cnblogs.com/Toolo/p/3634563.html
http://www.cnblogs.com/mliang/p/3637937.html
http://www.cnblogs.com/xwdreamer/archive/2012/07/19/2599494.html
http://www.cnblogs.com/ggjucheng/archive/2012/11/11/2765237.htm

source ：cnblogs.com/tangyanbo/p/4462734.html

recommend

Java Interview questions

Technical involution group , Learn together ！！

PS： Because the official account platform changed the push rules. , If you don't want to miss the content , Remember to click after reading “ Looking at ”, Add one “ Star standard ”, In this way, each new article push will appear in your subscription list for the first time . spot “ Looking at ” Support us ！