Internal Errigal Collaboration Wiki

This is an old revision of the document!

ElastAlertNoScrape

Level: Critical

Purpose: Notify Errigal Operations staff that the elastalert application on prometheus prod has not completed a scrape on the DF-80 (80% disk full) or DF-60 (60% disk full) Elast Alert rules in the last 2 minutes.

Resolution: Elastalert runs in a docker container on prometheus prod. It is monitoring NUCS Disk space in KLAs network by checking elastic search data that has pulled the Operating system stats. If these dont catch log files filling up the HDD the NUC will go offline.

Manual Action Steps:

sudo docker logs --tail 100 elastalert --follow

sudo docker restart elastalert

Observe logs and be sure that the logs show elastalert running the query for both DF-80 and DF-60. These two lines should be present. A restart should do the trick if there was an error and this caused the application to disable the rules.

INFO:elastalert:Queried rule df-80 from 2024-08-06 19:41 UTC to 2024-08-06 19:52 UTC: 0 / 0 hits

INFO:elastalert:Queried rule df-60 from 2024-08-06 19:41 UTC to 2024-08-06 19:53 UTC: 0 / 0 hits

Another thing to check is the rule yaml files

cd /home/scotty/elastalert_config/rules

sudo vi df-60.yaml or sudo vi df-80.yaml

Ensure the setting is_enabled is set to true.

Auto Clear: Will auto clear if the elastalert_scrapes_total counter increases again

Sidebar

Internal Errigal Collaboration Wiki

User Tools

Site Tools

ElastAlertNoScrape

Page Tools