Skipping MCAD CPU Preemption Test #696

Fiona-Waters · 2023-12-07T13:19:50Z

Skipping the MCAD CPU Preemption Test which is failing intermittently on PRs so that we can get some outstanding PRs merged.

openshift-ci · 2023-12-07T13:19:58Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign anishasthana for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ronensc · 2023-12-07T15:37:13Z

For future reference, the root cause analysis of the test's failure has been conducted by @dgrove-oss, and it can be found here:
#691 (comment)

Fiona-Waters · 2023-12-08T09:40:00Z

For future reference, the root cause analysis of the test's failure has been conducted by @dgrove-oss, and it can be found here: #691 (comment)

Thanks @ronensc That's good to know!

dgrove-oss · 2023-12-08T14:48:31Z

I don't think it's worth backporting, but I did redo these test cases for mcad v2 to be robust against different cluster sizes in project-codeflare/mcad#83

Fiona-Waters · 2023-12-11T14:18:54Z

More investigation is required as to why these tests are failing. Closing this PR.

KPostOffice

Looks good. I like the move to more generic tests. One question.

KPostOffice · 2023-12-19T19:55:41Z

test/e2e/queue.go

+		//aw := createDeploymentAWwith550CPU(context, appendRandomString("aw-deployment-2-550cpu"))
+		cap := getClusterCapacitycontext(context)
+		resource := cpuDemand(cap, 0.275).String()
+		aw := createGenericDeploymentCustomPodResourcesWithCPUAW(


What happens if the cluster has many smaller nodes resulting a a high cap but inability to schedule AppWrappers becauase they do not fit on the individual nodes? Do we care about that at all in this test case?

From a test case perspective, the cluster is assumed to have homogenous nodes and it requests deployments that fit on a node in the cluster in CPU dimension.

Fiona-Waters

This looks great, so happy to move forward with this improvement. Just a couple of small comments.

Fiona-Waters · 2023-12-20T17:21:00Z

test/e2e/util.go

@@ -793,6 +795,36 @@ func createDeploymentAWwith550CPU(context *context, name string) *arbv1.AppWrapp
 	return appwrapper
 }

+func getClusterCapacitycontext(context *context) *clusterstateapi.Resource {
+	capacity := clusterstateapi.EmptyResource()
+	nodes, _ := context.kubeclient.CoreV1().Nodes().List(context.ctx, metav1.ListOptions{})


We should handle the error here.

Fiona-Waters · 2023-12-20T17:22:27Z

test/e2e/util.go

+		podList, err := context.kubeclient.CoreV1().Pods("").List(context.ctx, metav1.ListOptions{FieldSelector: labelSelector})
+		// TODO: when no pods are listed, do we send entire node capacity as available
+		// this will cause false positive dispatch.
+		if err != nil {


Should the error be caught like this instead?

Suggested change

if err != nil {

Expect(err).NotTo(HaveOccurred()

openshift-ci bot requested review from KPostOffice and metalcycling December 7, 2023 13:19

Fiona-Waters force-pushed the skip-flaky-e2e branch from 0a8c451 to 5c3d3ed Compare December 7, 2023 20:46

Adding skip to flaky tests

701e8e9

Fiona-Waters force-pushed the skip-flaky-e2e branch from 5c3d3ed to 701e8e9 Compare December 8, 2023 09:07

Fiona-Waters closed this Dec 11, 2023

KPostOffice reopened this Dec 13, 2023

asm582 added 18 commits December 15, 2023 09:49

use only 2 cpus

4db97fb

add dockerd cmd

f5f6743

remove kind resource config

816733a

add docker res config

41961a9

debug docker res config-1

f8a91da

debug docker res config-2

2d8401c

debug docker res config-3

fa08868

debug docker res config-4

b7917d6

debug docker res config-5

265214a

debug docker res config-6

2ff4f37

debug docker res config-7

25e6ec1

debug docker res config-8

fba9c92

debug docker res config-9

1182c22

debug docker res config-10

b45b7af

debug docker res config-11

58aa461

debug docker res config-12

65f9a8e

debug docker res config-13

105f3ff

debug docker res config-14

8f536c0

asm582 added 10 commits December 15, 2023 14:37

debug docker res config-15

45345be

debug docker res config-16

10bb865

debug docker res config-17

fddcc58

debug docker res config-18

40648bb

debug docker res config-19

73f374e

debug docker res config-20

dd1c862

debug docker res config-21

a991ddc

debug docker res config-22

883e04a

fix failing test

ff376f5

fix test 2

8eeaaf8

asm582 removed the request for review from metalcycling December 17, 2023 20:42

KPostOffice reviewed Dec 19, 2023

View reviewed changes

Fiona-Waters commented Dec 20, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skipping MCAD CPU Preemption Test #696

Skipping MCAD CPU Preemption Test #696

Fiona-Waters commented Dec 7, 2023

openshift-ci bot commented Dec 7, 2023

ronensc commented Dec 7, 2023

Fiona-Waters commented Dec 8, 2023

dgrove-oss commented Dec 8, 2023

Fiona-Waters commented Dec 11, 2023

KPostOffice left a comment

KPostOffice Dec 19, 2023 •

edited

asm582 Dec 20, 2023

Fiona-Waters left a comment

Fiona-Waters Dec 20, 2023

Fiona-Waters Dec 20, 2023

Skipping MCAD CPU Preemption Test #696

Are you sure you want to change the base?

Skipping MCAD CPU Preemption Test #696

Conversation

Fiona-Waters commented Dec 7, 2023

openshift-ci bot commented Dec 7, 2023

ronensc commented Dec 7, 2023

Fiona-Waters commented Dec 8, 2023

dgrove-oss commented Dec 8, 2023

Fiona-Waters commented Dec 11, 2023

KPostOffice left a comment

Choose a reason for hiding this comment

KPostOffice Dec 19, 2023 • edited

Choose a reason for hiding this comment

asm582 Dec 20, 2023

Choose a reason for hiding this comment

Fiona-Waters left a comment

Choose a reason for hiding this comment

Fiona-Waters Dec 20, 2023

Choose a reason for hiding this comment

Fiona-Waters Dec 20, 2023

Choose a reason for hiding this comment

KPostOffice Dec 19, 2023 •

edited