Files
grafana-terraform/environments/dev/adibrov/alerts/node/node_instance_down.yaml

20 lines
546 B
YAML
Raw Normal View History

name: "DEV ADIBROV - Node Instance Down"
expression: |
up{job="node_exporter"}
threshold: 1
for: "2m"
condition_type: "lt"
need_reduce: true
reducer_type: "min"
no_data_state: "Alerting"
exec_err_state: "Alerting"
labels:
service: "system"
severity: "critical"
team: "infra"
summary: |
Хост недоступен: {{ $labels.instance }}
description: |
Node exporter на {{ $labels.instance }} не отвечает уже более 2 минут.
Хост может быть недоступен или упал node_exporter.