User Tools
resolution_area:prometheus_resolutions:res-p1117
AppDown: Node Monitor
Level:
- NodeMonitorAccidentallyInActive - Critical
Purpose: When we perform an application deployment for a new version but we have accidentally disabled the Node Monitor for a customer that is still using the Node Monitor.
Scenario: SNMP Manager is on, but the Node Monitor is not.
Resolution: Node Monitor is up and running. It is shown by a line in the logs saying: “Completed refreshing ClientNetworkElementPoller”
Manual Action Steps:
- First you must question, does the customer with this alert require the Node Monitor.
- If not, then remove the alert for this customer
- If yes, then check the customer's env-configuration file for the field 'SnmpManager > nodeMonitorEnabled'.
- Add that field and set it to 'true' then redeploy.
- If that exists there and the Node Monitor didn't work on the deployment, then something more serious has happened.
- Secondly, you may check if this is for anyone else.
Auto Clear: No. Application must be redeployed.
Note: Node Monitor is due for EOL, ExteNet are still holding on to it. Someday it will turn off and this will fire. You can remove the mTail metric and the Prometheus Alert for this in the prometheus-monitoring-config
resolution_area/prometheus_resolutions/res-p1117.txt · Last modified: 2021/12/20 15:30 by wflaherty