add 100-node higher QPS limit scheduler job #20023

adtac · 2020-11-23T18:29:25Z

/hold until kubernetes/kubernetes#96813 is merged

So sig/scheduling has made numerous improvements to the scheduler over the past few releases, but these improvements have not been realised in practice due to the scheduler's QPS limits. With API priority and fairness being graduated to beta in 1.20 (kubernetes/kubernetes#96527), we now have a path towards safely removing the scheduler's rate limiting in the 1.21+.

Of course, this would need to be done slowly. @ahg-g and I discussed this offline and we believe that increasing the scheduler's QPS limit (not removing) in 1.21 is the first step. Once things are observed to be stable over a couple of releases, the client-side rate limiting will be entirely limit and kube-scheduler will depend on APF to make sure it doesn't overload the API server.

While I have done some preliminary benchmarking of APF with higher scheduler QPS limits (not public yet, will share with everyone in a short while), there would be more confidence in this approach if there were public, periodic benchmarks that run on test-infra. The results could be displayed in perf-dash. These benchmarks should run on the higher QPS limit. The job should be carefully observed for signs of instability. The job would be temporary and is only expected to run for a 3-4 releases; once the scheduler's client-side QPS limits are removed, just the regular job (ci-kubernetes-kubemark-100-gce) would be sufficient.

A separate job is needed to do this because all changes happen during cluster creation. As a result, existing jobs cannot be re-used for this.

/sig scheduling
cc @ahg-g

config/jobs/kubernetes/sig-scalability/sig-scalability-periodic-jobs.yaml

Signed-off-by: Adhityaa Chandrasekar <adtac@google.com>

adtac · 2020-12-07T14:23:58Z

@wojtek-t updated this PR to use --env=CONTROLLER_MANAGER_TEST_ARGS and --env=SCHEDULER_TEST_ARGS (ref: kubernetes/kubernetes#96813 (comment))

adtac · 2020-12-14T14:26:07Z

/assign @wojtek-t

wojtek-t · 2020-12-16T09:22:56Z

config/jobs/kubernetes/sig-scalability/sig-scalability-periodic-jobs.yaml

+    preset-service-account: "true"
+    preset-k8s-ssh: "true"
+    preset-dind-enabled: "true"
+    preset-e2e-kubemark-common: "true"


I'm fine with starting with kubemark, but eventually we should consider migrating to real clusters. Kubemark is still visibly different than real clusters (easier from the point-of-view of control-plane).

wojtek-t · 2020-12-16T09:25:48Z

@adtac - next time please ping me if I'm not responding for week+.

/lgtm
/approve

k8s-ci-robot · 2020-12-16T09:26:06Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: adtac, wojtek-t

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~config/jobs/kubernetes/sig-scalability/OWNERS~~ [wojtek-t]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

adtac · 2020-12-21T20:48:00Z

/hold cancel

k8s-ci-robot · 2020-12-21T20:56:32Z

@adtac: Updated the job-config configmap in namespace default at cluster test-infra-trusted using the following files:

key sig-scalability-periodic-jobs.yaml using file config/jobs/kubernetes/sig-scalability/sig-scalability-periodic-jobs.yaml

In response to this:

/hold until kubernetes/kubernetes#96813 is merged

So sig/scheduling has made numerous improvements to the scheduler over the past few releases, but these improvements have not been realised in practice due to the scheduler's QPS limits. With API priority and fairness being graduated to beta in 1.20 (kubernetes/kubernetes#96527), we now have a path towards safely removing the scheduler's rate limiting in the 1.21+.

Of course, this would need to be done slowly. @ahg-g and I discussed this offline and we believe that increasing the scheduler's QPS limit (not removing) in 1.21 is the first step. Once things are observed to be stable over a couple of releases, the client-side rate limiting will be entirely limit and kube-scheduler will depend on APF to make sure it doesn't overload the API server.

While I have done some preliminary benchmarking of APF with higher scheduler QPS limits (not public yet, will share with everyone in a short while), there would be more confidence in this approach if there were public, periodic benchmarks that run on test-infra. The results could be displayed in perf-dash. These benchmarks should run on the higher QPS limit. The job should be carefully observed for signs of instability. The job would be temporary and is only expected to run for a 3-4 releases; once the scheduler's client-side QPS limits are removed, just the regular job (ci-kubernetes-kubemark-100-gce) would be sufficient.

A separate job is needed to do this because all changes happen during cluster creation. As a result, existing jobs cannot be re-used for this.

/sig scheduling
cc @ahg-g

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

BenTheElder · 2021-02-26T06:20:33Z

config/jobs/kubernetes/sig-scalability/sig-scalability-periodic-jobs.yaml

+      - --scenario=kubernetes_e2e
+      - --
+      - --cluster=kubemark-100-scheduler-highqps
+      - --build=bazel


is there a reason to be checking out kubernetes and doing a source build here?
most CI jobs consume our existing builds (--extract=ci/latest or similar)

this is cheaper and faster

not really, but IIRC to test the PR I had to build from my fork and I must have forgotten to change back to --extract=ci/latest. I'll open a PR to switch to reuse builds.

Ack thanks!
I'm working on switching over how we build (bazel=>quick) but suspected this one shouldn't be building anyhow

#21062 covers this

thanks @BenTheElder!

k8s-ci-robot requested review from mborsz and wojtek-t November 23, 2020 18:29

mm4tt reviewed Nov 24, 2020

View reviewed changes

wojtek-t reviewed Nov 24, 2020

View reviewed changes

config/jobs/kubernetes/sig-scalability/sig-scalability-periodic-jobs.yaml Show resolved Hide resolved

adtac force-pushed the scheduler-200qps branch 2 times, most recently from 920b451 to ebe13eb Compare December 1, 2020 14:33

k8s-ci-robot added the area/testgrid label Dec 1, 2020

wojtek-t reviewed Dec 3, 2020

View reviewed changes

adtac force-pushed the scheduler-200qps branch from ebe13eb to 871b590 Compare December 4, 2020 19:08

add 100-node higher QPS limit scheduler job

2d55b91

Signed-off-by: Adhityaa Chandrasekar <adtac@google.com>

adtac force-pushed the scheduler-200qps branch from 871b590 to a684db7 Compare December 7, 2020 14:04

adtac mentioned this pull request Dec 7, 2020

cluster/gce: allow setting custom API rate limits kubernetes/kubernetes#96813

Closed

100Nodes-scheduler: reduce timeout to 150 minutes

0204e3e

Signed-off-by: Adhityaa Chandrasekar <adtac@google.com>

adtac force-pushed the scheduler-200qps branch from a684db7 to 0204e3e Compare December 7, 2020 14:22

k8s-ci-robot assigned wojtek-t Dec 14, 2020

wojtek-t reviewed Dec 16, 2020

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 16, 2020

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 16, 2020

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Dec 21, 2020

k8s-ci-robot merged commit aafaaeb into kubernetes:master Dec 21, 2020

k8s-ci-robot added this to the v1.21 milestone Dec 21, 2020

BenTheElder reviewed Feb 26, 2021

View reviewed changes

BenTheElder mentioned this pull request Feb 27, 2021

update scalability jobs #21062

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add 100-node higher QPS limit scheduler job #20023

add 100-node higher QPS limit scheduler job #20023

adtac commented Nov 23, 2020

adtac commented Dec 7, 2020 •

edited

Loading

adtac commented Dec 14, 2020

wojtek-t Dec 16, 2020

wojtek-t commented Dec 16, 2020

k8s-ci-robot commented Dec 16, 2020

adtac commented Dec 21, 2020

k8s-ci-robot commented Dec 21, 2020

BenTheElder Feb 26, 2021

adtac Feb 26, 2021

BenTheElder Feb 26, 2021

BenTheElder Feb 27, 2021

adtac Mar 1, 2021

add 100-node higher QPS limit scheduler job #20023

add 100-node higher QPS limit scheduler job #20023

Conversation

adtac commented Nov 23, 2020

adtac commented Dec 7, 2020 • edited Loading

adtac commented Dec 14, 2020

wojtek-t Dec 16, 2020

Choose a reason for hiding this comment

wojtek-t commented Dec 16, 2020

k8s-ci-robot commented Dec 16, 2020

adtac commented Dec 21, 2020

k8s-ci-robot commented Dec 21, 2020

BenTheElder Feb 26, 2021

Choose a reason for hiding this comment

adtac Feb 26, 2021

Choose a reason for hiding this comment

BenTheElder Feb 26, 2021

Choose a reason for hiding this comment

BenTheElder Feb 27, 2021

Choose a reason for hiding this comment

adtac Mar 1, 2021

Choose a reason for hiding this comment

adtac commented Dec 7, 2020 •

edited

Loading