Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Latency/Response Spike Alerting #593

Open
nzurku opened this issue Oct 20, 2023 · 1 comment
Open

Latency/Response Spike Alerting #593

nzurku opened this issue Oct 20, 2023 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@nzurku
Copy link

nzurku commented Oct 20, 2023

We have found it to be relevant and easiest if Cloudprober could detect a spike in latency or response times by itself for alerting.

While the logic/configurables could be debated, something that came to my mind is this:

Rule: if Latency/Response 25% Greater
Check Range: 5 vs 25, compare 5 most recent checks to the previous 25 checks (excluding the most recent 5)
Then: Trigger alerting for a spike in latency/response, raise a metric of this spike/increase to be 1/yes.

@nzurku nzurku added the enhancement New feature or request label Oct 20, 2023
@manugarg
Copy link
Contributor

I think this is kind of interesting. We could add another condition type, say "spike" that would keep average of latencies over time and fire an alert if latency is more than the average by a certain percentage. It will require careful implementation though, I'll think more about it.

@manugarg manugarg self-assigned this Oct 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants