Latency/Response Spike Alerting #593

nzurku · 2023-10-20T20:35:40Z

We have found it to be relevant and easiest if Cloudprober could detect a spike in latency or response times by itself for alerting.

While the logic/configurables could be debated, something that came to my mind is this:

Rule: if Latency/Response 25% Greater
Check Range: 5 vs 25, compare 5 most recent checks to the previous 25 checks (excluding the most recent 5)
Then: Trigger alerting for a spike in latency/response, raise a metric of this spike/increase to be 1/yes.

manugarg · 2023-10-22T04:01:08Z

I think this is kind of interesting. We could add another condition type, say "spike" that would keep average of latencies over time and fire an alert if latency is more than the average by a certain percentage. It will require careful implementation though, I'll think more about it.

nzurku added the enhancement New feature or request label Oct 20, 2023

manugarg self-assigned this Oct 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Latency/Response Spike Alerting #593

Latency/Response Spike Alerting #593

nzurku commented Oct 20, 2023 •

edited

manugarg commented Oct 22, 2023

Latency/Response Spike Alerting #593

Latency/Response Spike Alerting #593

Comments

nzurku commented Oct 20, 2023 • edited

manugarg commented Oct 22, 2023

nzurku commented Oct 20, 2023 •

edited