User Tools
resolution_area:prometheus_resolutions:res-p1104
AlarmCacheQueueIncreasing
Level: Critical
Purpose: Identify when the RabbitMQ alarm_cache_inbound_queue is increasing.
Scenario: There have been >= 5000 messages in the alarm cache queue for 120s. This means the RabbitMQ is not processing these messages fast enough.
Resolution: Reduce alarm cache queue.
Manual Action Steps: See http://wiki.err/doku.php?id=development:applications:alarmcache:troubleshooting
Auto Clear: Alarm will clear when there is less than 5000 messages in the alarm cache queue
resolution_area/prometheus_resolutions/res-p1104.txt · Last modified: 2021/07/05 12:56 by 10.91.120.28