New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ceph: fix 'CephMonQuorumLost' alert #9068
ceph: fix 'CephMonQuorumLost' alert #9068
Conversation
/assign @leseb |
Only the 'Running' mons with result value of '1' should be counted. Signed-off-by: Arun Kumar Mohan <amohan@redhat.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please use monitoring
as a commit prefix
f5df591
to
af44e5c
Compare
@@ -90,7 +90,7 @@ spec: | |||
severity_level: critical | |||
storage_type: ceph | |||
expr: | | |||
count(kube_pod_status_phase{pod=~"rook-ceph-mon-.*", phase=~"Running|running"}) by (namespace) < 2 | |||
count(kube_pod_status_phase{pod=~"rook-ceph-mon-.*", phase=~"Running|running"} == 1) by (namespace) < 2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Doesn't the < 2
already cover the == 1
case?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ceph: fix 'CephMonQuorumLost' alert (backport #9068)
Only the 'Running' mons with result value of '1' should be counted.
Signed-off-by: Arun Kumar Mohan amohan@redhat.com
Description of your changes:
Which issue is resolved by this Pull Request:
Resolves #
Checklist:
make codegen
) has been run to update object specifications, if necessary.