User Tools

Site Tools


Writing /app/www/public/data/meta/resolution_area/prometheus_resolutions/res-p1704.meta failed
resolution_area:prometheus_resolutions:res-p1704

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
resolution_area:prometheus_resolutions:res-p1704 [2021/06/21 12:57] – external edit 127.0.0.1resolution_area:prometheus_resolutions:res-p1704 [2021/07/05 20:08] (current) 10.91.120.28
Line 1: Line 1:
 +===== MysqlSlaveIoThreadNotRunning =====
  
 +**Level:** __Critical__ FIXME
 +
 +
 +**Purpose:** Alert Errigal engineers to a problem with replication to the secondary database(s). Replication databases are used for Reporting Applications, issues with replication mean the data becomes out of date and customer reports will be incorrect.
 +
 +**Scenario:** MySQL Slave IO thread has not been running on <server> for 5m. 
 +
 +**Resolution:** Check the replication status on the server, 'show slave status'. Confirm slave io thread is actually not running. Check InnoDB engine logs to try to determine why replication stopped. To restart the IO process, Run 'stop slave' followed by 'start slave', check if replication status is restored.
 + 
 +If the replica database still refuses to resume, examine the error column in 'show slave status' and continues the investigation from this point.
 +
 +**Manual Action Steps:**
 +<code>
 +- Check errors in 'show slave status'
 +- Restart replication on the replica
 +    - stop slave;
 +    - start slave;
 +</code>
 +
 +
 +**Auto Clear:** Yes