feat: add postgres/gitea/blackbox alerts and more node alerts
Some checks failed
terraform-dev / validate (push) Successful in 6s
terraform-dev / plan (push) Failing after 11s
terraform-dev / apply (push) Has been skipped

This commit is contained in:
Alexandr
2026-04-03 11:34:08 +03:00
parent 03dfa99400
commit 5af763ebb1
18 changed files with 393 additions and 1 deletions

View File

@ -0,0 +1,19 @@
name: "DEV ADIBROV - Gitea Down"
expression: |
gitea_build_info{job="gitea"}
threshold: 1
for: "2m"
condition_type: "lt"
need_reduce: true
reducer_type: "min"
no_data_state: "Alerting"
exec_err_state: "Alerting"
labels:
service: "gitea"
severity: "critical"
team: "infra"
summary: |
Gitea недоступна на {{ $labels.instance }}
description: |
Метрики Gitea не поступают с {{ $labels.instance }}.
Сервис Gitea либо упал, либо недоступен scrape endpoint.

View File

@ -0,0 +1,19 @@
name: "DEV ADIBROV - Gitea Open Issues Spike"
expression: |
increase(gitea_issues_open{job="gitea"}[1h])
threshold: 50
for: "5m"
condition_type: "gt"
need_reduce: true
reducer_type: "max"
no_data_state: "OK"
exec_err_state: "Error"
labels:
service: "gitea"
severity: "warning"
team: "infra"
summary: |
Всплеск открытых issue в Gitea: +{{ printf "%.0f" $values.B.Value }} за час
description: |
За последний час количество открытых issues в Gitea выросло на {{ printf "%.0f" $values.B.Value }}.
Возможна массовая автоматическая генерация задач или проблема с интеграцией.