New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix Grafana dashboards. #7121
Fix Grafana dashboards. #7121
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #7121 +/- ##
==========================================
- Coverage 65.04% 64.98% -0.07%
==========================================
Files 221 221
Lines 20799 20799
==========================================
- Hits 13529 13516 -13
- Misses 6133 6143 +10
- Partials 1137 1140 +3 ☔ View full report in Codecov by Sentry. |
Gathering all diffs from dapr/test-infra#204 we got to the following regex: sed -i \
-e 's/\bkubernetes_name\b/service/g' \
-e 's/\bkubernetes_namespace\b/namespace/g' \
-e 's/\bkubernetes_node\b/node/g' \
-e 's/\bkubernetes_pod_name\b/pod/g' \
*.json |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is the grafana dashboard only limited to longhaul_test
? And why are the service names hardcoded?
Isn't this a general grafana dashboard that can be optionally installed on any dapr cluster?
cc @dapr/maintainers-dapr ?
grafana/grafana-actor-dashboard.json
Outdated
"text": "longhaul-test", | ||
"value": "longhaul-test" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is this specific to longhaul_test
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It doesn't have to be specific to longhauls, but if it is going to be an arbitrary string, why not have it be something useful?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While I think it is safe to keep these longhaul settings for the reasons I mentioned before, I added a98f3f1 to remove those specific settings.
It isn't. This PR accomplishes 2 things:
The last item is the contentious one so let me dive on it. While the last item might make you believe that we are limiting it to First because grafana pics up the contents selectable for the variables Second, you might observe that there already are hardcoded value for these 2 these variables:
So that, upon install in a fresh longhaul cluster + grafana install, all apps existing are selected by default. I am just saving us a few clicks.
My point is that it is just as generic as it was before. Upon loading, those variables whose values we are pre-feeding will be gathered by the metric queries associated with them. |
The current grafana dashboards do not work in a fresh cluster where prometheus and grafana are installed using helm following Dapr Docs (see [1], [2]). They refer to metrics that are not available in such install. In short, based on bug-report from dapr/test-infra#204, the proposed fix can be summed by: ```bash sed -i \ -e 's/\bkubernetes_name\b/service/g' \ -e 's/\bkubernetes_namespace\b/namespace/g' \ -e 's/\bkubernetes_node\b/node/g' \ -e 's/\bkubernetes_pod_name\b/pod/g' \ *.json ``` Additionally: * Removes refresh rates smaller than 1 minute. * Sets default interval range to 14 days in the past to now * Sets default template values to match the longhaul clusters. Fixes dapr#7120 [1]: https://docs.dapr.io/operations/observability/metrics/prometheus/#setup-prometheus-on-kubernetes [2]: https://docs.dapr.io/operations/observability/metrics/grafana/#setup-on-kubernetes Signed-off-by: Tiago Alves Macambira <tmacam@burocrata.org>
Signed-off-by: Tiago Alves Macambira <tmacam@burocrata.org>
Description
The current grafana dashboards do not work in a fresh cluster where prometheus and grafana are installed using helm following Dapr Docs (see 1, 2). They refer to metrics that are not available in such install.
In short, based on bug-report from dapr/test-infra#204, the proposed fix can be summed by:
Additionally:
Issue reference
Please reference the issue this PR will close: #7120
Testing done
Imported proposed JSON files to longhaul environments. See dapr/test-infra#167
Checklist
Please make sure you've completed the relevant tasks for this PR, out of the following list: