Tracking long running processes in MySQL is not difficult. A process can be:
- SQL query
- Transaction
There are different ways of tracking both type of processes.. First we would look at some of the available methods to identify long running queries:
- slow query log
- [pt-kill]
- performance schema
- packet inspection
Slow query log - This is the common method of identifying slow queries. You would need to check if its already enabled:
mysql> SHOW GLOBAL VARIABLES WHERE VARIABLE_NAME LIKE 'slow_query%' OR VARIABLE_NAME LIKE 'long_query_time'; +---------------------+----------------------------------+ | Variable_name | Value | +---------------------+----------------------------------+ | long_query_time | 10.000000 | | slow_query_log | OFF | | slow_query_log_file | /data/mysql/log/mariadb-slow.log | +---------------------+----------------------------------+ 3 rows in set (0.01 sec)
In this example, it is not. To enable slow query log, issue following commands:
mysql> SET GLOBAL slow_query_log=ON; Query OK, 0 rows affected (0.00 sec) mysql> set global long_query_time=1.0; Query OK, 0 rows affected (0.00 sec)
I took the opportunity to reduce long_query_time value from 10 seconds to just 1 second as well. The idea is to capture all queries that take at least 1 second. To make those changes persistent you would have to modify MySQL configuration file (e.g. /etc/my.cnf or my.ini)
[mysqld] .. slow_query_log = ON long_query_time = 1
Next you would like to summarize slow query log, you have two famous methods to choose from:
mysqldumpslow - classic method that have been used since mysql is born, on-line docs explain it very well, click here
pt-query-digest - This method has been widely adopted by the community, click here to learn more about it.
If you are still using older release of MySQL e.g. 5.0 or earlier. It is not possible to enable slow query logging without rebooting MySQLd process. If you cannot restart MySQL process, you can still track slow queries using pt-kill utility:
$ pt-kill --host=localhost -udba --ask-pass --busy-time 1 --print --daemonize --log /var/log/slow-query.log # --busy-time decides queries that have been running for longer than this time
Remember NOT to specify --kill option, as it would start killing your queries
Performance schema - It was introduced in version 5.5, by default it is enabled in 5.6+ release. It provides a way to inspect internal server events (such as IO activity, locks, query profiling etc). It can also be used to track timing on long running processes. Below are few examples of the commands that can be used to track long running queries, queries that use temporary table and/or the ones that perform full-table scan:
### ## A high level overview of the statements like Query Analysis, sorted by those queries with the highest latency ## SELECT DIGEST, DIGEST_TEXT AS query, first_seen, last_seen, COUNT_STAR AS exec_count, SUM_NO_INDEX_USED AS no_index_used_count, SUM_NO_GOOD_INDEX_USED AS no_good_index_used_count, ROUND((SUM_NO_INDEX_USED / COUNT_STAR) * 100) no_index_used_pct FROM performance_schema.events_statements_summary_by_digest WHERE SUM_NO_INDEX_USED > 0 OR SUM_NO_GOOD_INDEX_USED > 0 ORDER BY no_index_used_pct DESC, exec_count DESC LIMIT 5\G ### # List all normalized statements that use temporary tables ordered by number of on disk temporary tables descending first, then by the number of memory tables. ### SELECT IF(LENGTH(DIGEST_TEXT) > 64, CONCAT(LEFT(DIGEST_TEXT, 30), ' ... ', RIGHT(DIGEST_TEXT, 30)), DIGEST_TEXT) AS query, COUNT_STAR AS exec_count, SUM_CREATED_TMP_TABLES AS memory_tmp_tables, SUM_CREATED_TMP_DISK_TABLES AS disk_tmp_tables, ROUND(SUM_CREATED_TMP_TABLES / COUNT_STAR) AS avg_tmp_tables_per_query, ROUND((SUM_CREATED_TMP_DISK_TABLES / SUM_CREATED_TMP_TABLES) * 100) AS tmp_tables_to_disk_pct, DIGEST AS digest FROM performance_schema.events_statements_summary_by_digest WHERE SUM_CREATED_TMP_TABLES > 0 ORDER BY SUM_CREATED_TMP_DISK_TABLES DESC, SUM_CREATED_TMP_TABLES DESC LIMIT 5; ### # List all normalized statements that use have done a full table scan ordered by the percentage of times a full scan was done, then by the number of times the statement executed ### SELECT IF(LENGTH(DIGEST_TEXT) > 64, CONCAT(LEFT(DIGEST_TEXT, 30), ' ... ', RIGHT(DIGEST_TEXT, 30)), DIGEST_TEXT) AS query, COUNT_STAR AS exec_count, SUM_NO_INDEX_USED AS no_index_used_count, SUM_NO_GOOD_INDEX_USED AS no_good_index_used_count, ROUND((SUM_NO_INDEX_USED / COUNT_STAR) * 100) no_index_used_pct, DIGEST AS digest FROM performance_schema.events_statements_summary_by_digest WHERE SUM_NO_INDEX_USED > 0 OR SUM_NO_GOOD_INDEX_USED > 0 ORDER BY no_index_used_pct DESC, exec_count DESC LIMIT 5;
Packet inspection - this can be achieved using tcpdump utility, not covered in this article. This is due to its known limitations:
- you can only observe queries sent over the network.
- you don’t see queries sent through Unix sockets or via replication.
- you also can’t see queries run from stored procedures.
- security concern, this method requires root access.
In the next article, I would blog about how to identify long running transactions...
I like the helpful information you provide in your articles. I’ll bookmark your blog and check again here regularly. I’m quite certain I will learn a lot of new stuff right here! Best of luck for the next!
ReplyDeletecoursework Writing Service