-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow disabling of PersistentVolumeUsageNearFull/PersistentVolumeUsageCritical alerts on workloads that are expected to be fully utilized #9568
Comments
I didn't understand the point, |
The idea is that we give an escape hatch for workloads that are expected to take up the entire PVC by design, We would then annotate our PVCs to exclude them from being able to trigger the alert in https://github.com/kubevirt/containerized-data-importer. |
@akalenyu did you miss Travis's comment here? #9568 (comment) I believe this may alleviate the issue. (You could edit the rules to have them ignore PVCs containing label of your choosing) |
Sorry I should have been clearer about this; we can't really edit ceph rules from our project (containerized-data-importer), we're looking to handle this just by annotating the objects we are managing (PVCs). That is why the OpenShift monitoring approach worked for us. |
Do I take this to mean that you are not the admin of your kubernetes cluster? How is Rook being installed in your clusters? Rook will no longer be deploying Ceph prometheus rules with Travis's PR #9837. After this, users will have to deploy the rules themselves manually or via Helm. |
I am not an admin of a particular cluster, no. We don't install rook ourselves as part of the project, I noticed this alert on one of the clusters I was debugging #9837 might solve the issue, thank you - but I have a feeling that at some point OCS will decide to deploy this alerting rule automatically bringing us back to having this alert firing even though we expect the workloads to be nearly 100% utilized. |
Openshift has a proposal for alert customization that would benefit OCS. Until then if you don't have control of the PrometheusRule CRs created by OCS/Rook, not sure how you can suppress these. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in a week if no further activity occurs. Thank you for your contributions. |
This issue has been automatically closed due to inactivity. Please re-open if this still requires investigation. |
Is this a bug report or feature request?
What should the feature do:
Allow components to add a label to a PVC that prevents
PersistentVolumeUsageNearFull/PersistentVolumeUsageCritical
alerts from firing.(Similar to openshift/cluster-monitoring-operator#1493, can/should use the same key/value pair?)
What is use case behind this feature:
The use case for this is that some workloads (kubevirt/CDI) request
a PV that will by default be the exact size of the file (disk image),
causing the alerts to fire, when the reality is, that the size
of the data will never grow and the alert is obsolete.
Environment:
Clusters running kubevirt, but more use cases where PVC is full by design may exist
The text was updated successfully, but these errors were encountered: