Base resource requests on actual usage #2540

NimJay · 2024-05-10T23:31:00Z

Background

This fixes Base CPU/Memory Requests on Actual Usage #591.
I looked at a deployment of Online Boutique from 4 days ago. I noted the max (actually used) vCPU and memory values of each microservice — within the full 4 days. (The deployment included the loadgenerator.) Olivier did a similar analysis in 2022.
For each value, I rounded up to the next 10m (for CPU) and the next 10MiB for memory — to add a buffer.

Change Summary

I've updated the resource.requests.cpu and resource.requests.memory values of each default microservices's Deployment.

Testing Procedure

We just have to make sure the staging URL works fine. :)

github-actions · 2024-05-10T23:46:15Z

🚲 PR staged at http://34.42.127.234

github-actions · 2024-05-11T00:02:48Z

🚲 PR staged at http://34.42.127.234

NimJay · 2024-05-13T15:38:06Z

I have tested the new Kubernetes manifests on the regular channel and rapid channel (see delete-me-test-resources-new and delete-me-test-resources-new-rapid below).
The new manifests work.

However, I believe, on GKE autopilot, this only shaved off 0.25 vCPUs because the Managed Service for Prometheus is enabled by default on GKE autopilot and requires a lot (relative to Online Boutique) of resources.

github-actions · 2024-05-14T14:38:14Z

🚲 PR staged at http://34.42.127.234

github-actions · 2024-06-02T17:06:18Z

🚲 PR staged at http://34.42.127.234

mathieu-benoit · 2024-06-02T18:59:44Z

kubernetes-manifests/recommendationservice.yaml

-            cpu: 200m
-            memory: 450Mi
+            cpu: 30m
+            memory: 30Mi


Getting OOMKilled with this on a brand new Kind cluster locally, let's have 70Mi in limits and 50Mi in requests like the other Python app emailservice has?

mathieu-benoit · 2024-06-03T18:02:26Z

FYI: I just deployed this in my own GKE Autopilot cluster, and I'm getting these values, k top pods -n onlineboutique-development:

NAME                                    CPU(cores)   MEMORY(bytes)
adservice-bc9667f7c-cdbrd               254m         128Mi
cartservice-69754cfcc9-pvf88            7m           89Mi
checkoutservice-856c5f7f68-cwnnn        84m          49Mi
currencyservice-66cc88ff55-5mk9l        11m          74Mi
emailservice-7d69bb6774-cvx2b           14m          81Mi
frontend-7fcf78c9bd-nxrz8               4m           50Mi
loadgenerator-76f787d859-nthg6          12m          142Mi
paymentservice-7495f4f8d9-b94rh         4m           66Mi
productcatalogservice-5dffd678c-8b7nn   5m           51Mi
recommendationservice-ff8fcbf9d-cgbjz   6m           91Mi
redis-575569978c-2z5fh                  6m           40Mi
shippingservice-7b6764896f-9zhd5        5m           47Mi

In addition to the comment on the recommendationservice, I'm getting these restarts because of memory limits too restrictive:

adservice-bc9667f7c-cdbrd               1/2     CrashLoopBackOff   115 (72s ago)   5h39m
checkoutservice-856c5f7f68-cwnnn        2/2     Running            1 (7m50s ago)   5h39m
currencyservice-66cc88ff55-5mk9l        2/2     Running            5 (62m ago)     5h39m```

Should we have:

For adservice: 150Mi for memory limits? and 150m for cpu limits?
For checkoutservice: 80Mi for memory limits?
For currencyservice: 90Mi for memory limits?

kubernetes-manifests/recommendationservice.yaml

github-actions · 2024-06-04T14:39:47Z

🚲 PR staged at http://34.42.127.234

Base resource requests on actual usage

4898b69

NimJay marked this pull request as ready for review May 10, 2024 23:47

NimJay requested review from yoshi-approver and a team as code owners May 10, 2024 23:47

Update limits by adding 20 to requests

a5966ce

NimJay marked this pull request as draft May 11, 2024 00:15

NimJay added the do not merge Indicates a pull request not ready for merge, due to either quality or timing. label May 11, 2024

NimJay marked this pull request as ready for review May 11, 2024 00:16

NimJay removed the do not merge Indicates a pull request not ready for merge, due to either quality or timing. label May 11, 2024

Merge branch 'main' into nimjay-resource-requests

83d2c59

Merge branch 'main' into nimjay-resource-requests

3c85dec

mathieu-benoit reviewed Jun 2, 2024

View reviewed changes

mathieu-benoit mentioned this pull request Jun 3, 2024

score-k8s Humanitec-DemoOrg/onlineboutique-demo#25

Merged

mathieu-benoit mentioned this pull request Jun 3, 2024

Fix memory and cpu limits to avoid CrashLoopBackOff Humanitec-DemoOrg/onlineboutique-demo#26

Merged

NimJay commented Jun 4, 2024

View reviewed changes

kubernetes-manifests/recommendationservice.yaml Outdated Show resolved Hide resolved

NimJay commented Jun 4, 2024

View reviewed changes

kubernetes-manifests/recommendationservice.yaml Outdated Show resolved Hide resolved

Update based on Kind testing

531e293

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Base resource requests on actual usage #2540

Base resource requests on actual usage #2540

NimJay commented May 10, 2024 •

edited

github-actions bot commented May 10, 2024

github-actions bot commented May 11, 2024

NimJay commented May 13, 2024

github-actions bot commented May 14, 2024

github-actions bot commented Jun 2, 2024

mathieu-benoit Jun 2, 2024 •

edited

mathieu-benoit commented Jun 3, 2024 •

edited

github-actions bot commented Jun 4, 2024

Base resource requests on actual usage #2540

Are you sure you want to change the base?

Base resource requests on actual usage #2540

Conversation

NimJay commented May 10, 2024 • edited

Background

Change Summary

Testing Procedure

github-actions bot commented May 10, 2024

github-actions bot commented May 11, 2024

NimJay commented May 13, 2024

github-actions bot commented May 14, 2024

github-actions bot commented Jun 2, 2024

mathieu-benoit Jun 2, 2024 • edited

Choose a reason for hiding this comment

mathieu-benoit commented Jun 3, 2024 • edited

github-actions bot commented Jun 4, 2024

NimJay commented May 10, 2024 •

edited

mathieu-benoit Jun 2, 2024 •

edited

mathieu-benoit commented Jun 3, 2024 •

edited