A
resque_jobs_in_queue{instance="resque"}
Size: a a a
A
resque_jobs_in_queue{instance="resque"}
A
AS
MY
MY
MY
IE
VS
Y
Y
A
Y
A
groups:и когда количество минут есть - можно SLO отображать как-нибудь так: sum_over_time(mambu:errrate_less02:up_minutes[1d]) * 100 / (24 * 60)
- name: duration_p01less50
interval: 1m
rules:
# stores metric where success if p01 latency less than 50 ms
- record: mambu:quant01_less50:up
expr: histogram_quantile(0.1, sum by (le) (rate(mambu_request_duration_milliseconds_bucket[5m]))) < bool 50
# stores uptime of 1 minute range where sum of successes is not zero
- record: mambu:quant01less50:up_minutes
expr: clamp_max(sum_over_time(mambu:quant01_less50:up[1m]), 1)
- name: rps_p01less50
interval: 1m
rules:
# stores metric where success if amount of errors less than 20%
- record: mambu:errrate_less02:up
expr: (sum(rate(mambu_request_duration_milliseconds_count{status=~"[45].."}[5m])) / sum(rate(mambu_request_duration_milliseconds_count[5m]))) < bool 0.2
# stores uptime of 1 minute range where sum of successes is not zero
- record: mambu:errrate_less02:up_minutes
expr: clamp_max(sum_over_time(mambu:errrate_less02:up[1m]), 1)
AS
A
A
AS
A
AS