Disable default paging in list watches #51876

smarterclayton · 2017-09-03T19:06:03Z

For 1.8 this will be off by default. In 1.9 it will be on by default.
Add tests and rename some fields to use the chunking terminology.

Note that the pager may be used for other things besides chunking.

Follow on to #48921, we left the field on to get some exercise in the normal code paths, but needs to be disabled for 1.8.

@liggitt let's merge on wednesday.

smarterclayton · 2017-09-03T19:07:23Z

/approve no-issue

wojtek-t · 2017-09-04T12:41:46Z

/lgtm

fejta-bot · 2017-09-04T14:54:50Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to @fejta).

Review the full test history for this PR.

dims · 2017-09-04T17:42:46Z

/test all

fejta-bot · 2017-09-04T20:30:50Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to @fejta).

Review the full test history for this PR.

dims · 2017-09-04T21:35:23Z

Looks like a legit verify failure - FAILED hack/make-rules/../../hack/verify-bazel.sh 20s

fejta-bot · 2017-09-04T23:39:50Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to @fejta).

Review the full test history for this PR.

smarterclayton · 2017-09-05T00:01:39Z

Still set to DNM because we're gathering data

smarterclayton · 2017-09-05T21:37:47Z

/test pull-kubernetes-kubemark-e2e-gce-big

smarterclayton · 2017-09-06T03:42:28Z

/test pull-kubernetes-kubemark-e2e-gce-big

smarterclayton · 2017-09-06T14:45:21Z

Metrics from the two runs:

Before

I0905 22:34:40.939]   /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/scalability/load.go:103
I0905 22:34:41.038] Sep  5 22:34:41.037: INFO: Top latency metric: {Resource:pods Subresource: Verb:LIST Latency:{Perc50:9.261ms Perc90:11.478ms Perc99:48.501ms Perc100:0s} Count:9840}
I0905 22:34:41.038] Sep  5 22:34:41.037: INFO: Top latency metric: {Resource:services Subresource: Verb:DELETE Latency:{Perc50:12.146ms Perc90:15.141ms Perc99:35.373ms Perc100:0s} Count:821}
I0905 22:34:41.038] Sep  5 22:34:41.037: INFO: Top latency metric: {Resource:nodes Subresource: Verb:LIST Latency:{Perc50:23.318ms Perc90:25.504ms Perc99:27.916ms Perc100:0s} Count:46}
I0905 22:34:41.038] Sep  5 22:34:41.037: INFO: Top latency metric: {Resource:services Subresource: Verb:POST Latency:{Perc50:3.721ms Perc90:4.891ms Perc99:18.715ms Perc100:0s} Count:821}
I0905 22:34:41.039] Sep  5 22:34:41.037: INFO: Top latency metric: {Resource:services Subresource: Verb:LIST Latency:{Perc50:15.635ms Perc90:16.778ms Perc99:16.778ms Perc100:0s} Count:10}
I0905 22:34:41.039] Sep  5 22:34:41.037: INFO: Printing summary: APIResponsiveness

After

I0906 04:26:59.107]   /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/scalability/load.go:103
I0906 04:26:59.211] Sep  6 04:26:59.211: INFO: Top latency metric: {Resource:nodes Subresource: Verb:LIST Latency:{Perc50:24.351ms Perc90:28.475ms Perc99:56.18ms Perc100:0s} Count:48}
I0906 04:26:59.212] Sep  6 04:26:59.211: INFO: Top latency metric: {Resource:pods Subresource: Verb:LIST Latency:{Perc50:10.018ms Perc90:12.068ms Perc99:49.82ms Perc100:0s} Count:9840}
I0906 04:26:59.212] Sep  6 04:26:59.211: INFO: Top latency metric: {Resource:services Subresource: Verb:DELETE Latency:{Perc50:12.294ms Perc90:15.457ms Perc99:37.681ms Perc100:0s} Count:821}
I0906 04:26:59.212] Sep  6 04:26:59.211: INFO: Top latency metric: {Resource:replicationcontrollers Subresource: Verb:LIST Latency:{Perc50:22.8ms Perc90:22.8ms Perc99:22.8ms Perc100:0s} Count:2}
I0906 04:26:59.212] Sep  6 04:26:59.211: INFO: Top latency metric: {Resource:services Subresource: Verb:LIST Latency:{Perc50:15.143ms Perc90:20.35ms Perc99:20.35ms Perc100:0s} Count:10}
I0906 04:26:59.213] Sep  6 04:26:59.211: INFO: Printing summary: APIResponsiveness

wojtek-t · 2017-09-06T14:50:34Z

Slightly higher, but the difference isn't large. I think at that level of latencies (small tens of ms), that may be expected. WDYT?

smarterclayton · 2017-09-06T15:03:25Z

I was expecting to see a drop (since at the apiserver a paged list should be proportionally faster) on at least one of the high N resource types. Pods would be most likely. However, pods are likely dominated by node list watches, not by master list watches, so pod tail latency should go down.

smarterclayton · 2017-09-06T17:03:40Z

What is the max resource collection size on the cluster? I.e. how big do pods get at any one time? 1k? 9k?

smarterclayton · 2017-09-06T17:26:04Z

/test pull-kubernetes-kubemark-e2e-gce-big

smarterclayton · 2017-09-07T02:22:42Z

Ok, so with the second run a number of mutation operations had lower tail latencies. This would be expected when there are conflicting reads and writes (in etcd3 at the moment there are a few range locks that large range reads can take that block writes). However, paging wasn't happening frequently enough in the run to tell one way or another, because in practice this test doesn't ever reLIST - all caches start empty and are fed by watches. So I'm going to say we need a better test scenario for this before we can say one way or another.

Going to drop the last commit and get everything green, then reapply label to disable paging on the client side (as the PR originally mentions).

For 1.8 this will be off by default. In 1.9 it will be on by default. Add tests and rename some fields to use the `chunking` terminology. Note that the pager may be used for other things besides chunking.

wojtek-t · 2017-09-07T07:13:19Z

/lgtm

k8s-github-robot · 2017-09-07T07:13:27Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: smarterclayton, wojtek-t

Associated issue: 48921

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

~~pkg/client/OWNERS~~ [smarterclayton,wojtek-t]
~~pkg/controller/garbagecollector/OWNERS~~ [smarterclayton,wojtek-t]
~~staging/src/k8s.io/client-go/OWNERS~~ [smarterclayton,wojtek-t]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

smarterclayton · 2017-09-07T19:33:21Z

/retest

lavalamp · 2017-09-07T20:41:33Z

assign @jpbetz

fejta-bot · 2017-09-07T23:24:51Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to @fejta).

Review the full test history for this PR.

lavalamp · 2017-09-07T23:52:22Z

staging/src/k8s.io/client-go/tools/cache/listwatch.go

+	WatchFunc WatchFunc
+	// DisableChunking requests no chunking for this list watcher. It has no effect in Kubernetes 1.8, but in
+	// 1.9 will allow a controller to opt out of chunking.
+	DisableChunking bool


can this be named in the positive, especially since that would get the default behavior you want?

The plan was in 1.9 to remove the false && below. All of the test suites are already configured to bypass chunking by setting this flag, whereas all clients would still be opt out for beta. I wanted to avoid pointer insanity in internal code as well as bad globals (since this is core library code).

fejta-bot · 2017-09-08T03:15:49Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to @fejta).

Review the full test history for this PR.

k8s-github-robot · 2017-09-08T05:27:27Z

/test all [submit-queue is verifying that this PR is safe to merge]

k8s-github-robot · 2017-09-08T06:08:16Z

Automatic merge from submit-queue (batch tested with PRs 48552, 51876)

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Sep 3, 2017

k8s-github-robot assigned jsafrane and justinsb Sep 3, 2017

k8s-github-robot added the do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. label Sep 3, 2017

smarterclayton added this to the v1.8 milestone Sep 3, 2017

smarterclayton added kind/bug Categorizes issue or PR as related to a bug. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. labels Sep 3, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 3, 2017

k8s-ci-robot assigned wojtek-t Sep 4, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 4, 2017

wojtek-t mentioned this pull request Sep 4, 2017

delete pods API call latencies shot up on large cluster tests #51899

Closed

smarterclayton added the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label Sep 5, 2017

smarterclayton force-pushed the disable_client_paging branch from 4668912 to a528173 Compare September 6, 2017 02:33

k8s-github-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 6, 2017

smarterclayton force-pushed the disable_client_paging branch from a528173 to fe9d140 Compare September 6, 2017 17:25

smarterclayton force-pushed the disable_client_paging branch from fe9d140 to f68b34d Compare September 7, 2017 02:23

Disable default paging in list watches

8b571bb

For 1.8 this will be off by default. In 1.9 it will be on by default. Add tests and rename some fields to use the `chunking` terminology. Note that the pager may be used for other things besides chunking.

smarterclayton force-pushed the disable_client_paging branch from f68b34d to 8b571bb Compare September 7, 2017 03:11

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 7, 2017

smarterclayton removed the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label Sep 7, 2017

lavalamp reviewed Sep 7, 2017

View reviewed changes

k8s-github-robot merged commit eda3db5 into kubernetes:master Sep 8, 2017

pacoxu mentioned this pull request Feb 24, 2021

APIListChunking - feature gate to GA? #96497

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable default paging in list watches #51876

Disable default paging in list watches #51876

smarterclayton commented Sep 3, 2017 •

edited

smarterclayton commented Sep 3, 2017

wojtek-t commented Sep 4, 2017

fejta-bot commented Sep 4, 2017

dims commented Sep 4, 2017

fejta-bot commented Sep 4, 2017

dims commented Sep 4, 2017

fejta-bot commented Sep 4, 2017

smarterclayton commented Sep 5, 2017

smarterclayton commented Sep 5, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

wojtek-t commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 7, 2017

wojtek-t commented Sep 7, 2017

k8s-github-robot commented Sep 7, 2017

smarterclayton commented Sep 7, 2017

lavalamp commented Sep 7, 2017

fejta-bot commented Sep 7, 2017

lavalamp Sep 7, 2017

smarterclayton Sep 8, 2017 •

edited

fejta-bot commented Sep 8, 2017

k8s-github-robot commented Sep 8, 2017

k8s-github-robot commented Sep 8, 2017

Disable default paging in list watches #51876

Disable default paging in list watches #51876

Conversation

smarterclayton commented Sep 3, 2017 • edited

smarterclayton commented Sep 3, 2017

wojtek-t commented Sep 4, 2017

fejta-bot commented Sep 4, 2017

dims commented Sep 4, 2017

fejta-bot commented Sep 4, 2017

dims commented Sep 4, 2017

fejta-bot commented Sep 4, 2017

smarterclayton commented Sep 5, 2017

smarterclayton commented Sep 5, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

wojtek-t commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 6, 2017

smarterclayton commented Sep 7, 2017

wojtek-t commented Sep 7, 2017

k8s-github-robot commented Sep 7, 2017

smarterclayton commented Sep 7, 2017

lavalamp commented Sep 7, 2017

fejta-bot commented Sep 7, 2017

lavalamp Sep 7, 2017

Choose a reason for hiding this comment

smarterclayton Sep 8, 2017 • edited

Choose a reason for hiding this comment

fejta-bot commented Sep 8, 2017

k8s-github-robot commented Sep 8, 2017

k8s-github-robot commented Sep 8, 2017

smarterclayton commented Sep 3, 2017 •

edited

smarterclayton Sep 8, 2017 •

edited