User Tools

Site Tools


Writing /app/www/public/data/meta/resolution_area/prometheus_resolutions/res-p1101.meta failed
resolution_area:prometheus_resolutions:res-p1101

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
resolution_area:prometheus_resolutions:res-p1101 [2021/06/24 11:22] btobinresolution_area:prometheus_resolutions:res-p1101 [2021/07/05 11:15] (current) 10.91.120.28
Line 1: Line 1:
 +=====AppDown=====
  
 +**Level:**   __Critical__     FIXME
 +
 +
 +**Purpose:** To monitor application status and alert if it is not responsive
 +
 +**Scenario:** <application> on <server> has been down for more than 120s.
 +
 +**Resolution:** Restart the application
 +
 +**Manual Action Steps:** If the application is one of the grails apps then use the start/stop.sh scripts in the <wrap lo>/var/tomcat/$application/bin</wrap> directory. Remember to start SNMP Manager as sudo. Alternatively if the application is run as a service like the spring boot apps then you can restart via <wrap lo>sudo systemctl restart $application</wrap>
 +
 +**Auto Clear:** Will auto clear when app is responsive