User Tools

Site Tools


resolution_area:prometheus_resolutions:res-p1111

Watchdog (#)

Level: Critical :?:

Purpose: Alerts operations that the watchdog agent is not running on a server this can lead to support missing critical alerts.

Scenario: When a Watchdog cannot start it will generate the Watchdog Agent alert. There is an active alarm on the Watchdog Agent.

Resolution: Check if watchdog is running on server, restart if necessary. Logs are available at watchdog/logs

Manual Action Steps: Check for uncleared active alarm on Cerberus server

select * from active_alarm where cleared = false and context like '%Watchdog%'

Manually clear Watchdog Agent alarm.

update snmp_manager.active_alarm set cleared = True where cleared is False and context like '%Watchdog%'

Auto Clear: Will clear when issue has been resolved.

resolution_area/prometheus_resolutions/res-p1111.txt · Last modified: 2021/12/23 14:21 by 10.91.120.28