feat: remove normalize score section in the scheduling framwork #1178

draveness · 2019-07-29T15:42:02Z

/sig scheduling
/priority important-soon
/kind feature

As the discussion on kubernetes/kubernetes#80383, we decided to merge the normalize score extension point into the score.

/assign @bsalamat @ahg-g

draveness · 2019-07-29T15:46:09Z

/remove-kind feature
/cc @liu-cong

k8s-ci-robot · 2019-07-29T15:46:10Z

@draveness: GitHub didn't allow me to request PR reviews from the following users: liu-cong.

Note that only kubernetes members and repo collaborators can review this PR, and authors cannot review their own PRs.

In response to this:

/remove-kind feature
/cc @liu-cong

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

keps/sig-scheduling/20180409-scheduling-framework.md

draveness · 2019-07-30T02:13:34Z

@bsalamat Comments have been addressed, PTAL.

draveness · 2019-07-30T02:15:06Z

BTW: do we need to remove the normalize score in this image?

ahg-g

Thanks Draven

ahg-g · 2019-07-31T01:51:42Z

keps/sig-scheduling/20180409-scheduling-framework.md

+1. The first phase is called "score" which is used to rank nodes that have passed 
+the filtering phase. The scheduler will call `Score` of each scoring plugin for
+each node. There will be a well defined range of integers representing the minimum
+and maximum scores. 


@bsalamat we don't enforce this range currently, is this supposed to be enforced before or after the normalize (if exists)?

@bsalamat we don't enforce this range currently, is this supposed to be enforced before or after the normalize (if exists)?

if we refactor InterPodAffinity with the scheduling framework and the score-normalize pattern, we can't enforce this in the "score" phase.

We could reword to "the scoring plugin will return a node to score map after the score and normalize score phases, which will be a well-defined range of integers..."

But it would be a little more complicated for the user to understand.

We can enforce the range at the time we apply the weights.

We can enforce the range at the time we apply the weights.

Yes, we could do that, but the developers are supposed to decide how to enforce the range in this extension point, ex: shrink by the ratio ( 10*(current-min)/(max-min)), etc.

Yes, they can do that, but the framework should also either error out when the score returned by a plugin is not within the range, or apply a simple default flooring/ceiling logic.

we shouldn't call it by default

+1

If we set the range of scores to [0, 100], we should multiply the framework's score with 0.1 or priority functions' score with 10.

We can also keep the current range [0, 10] which don't need the annoying multiply operation. And I don't think [0, 10] and [0, 100] would make a difference here.

I don't have a reason to change the range from [0, 10] to [0, 100], unless past experiences indicated that the [0, 10] range is too tight, is that the case @bsalamat?

We have seen many instances that our scoring functions are unpredictable. While extending the range will not address this problem, it can be one step towards making the score functions more differentiating. One can imagine that in a large cluster with thousands of nodes, a [0, 10] range is not differentiating enough. Now that we are transitioning to the framework, we can consider expanding the range.

We have seen many instances that our scoring functions are unpredictable. While extending the range will not address this problem, it can be one step towards making the score functions more differentiating. One can imagine that in a large cluster with thousands of nodes, a [0, 10] range is not differentiating enough. Now that we are transitioning to the framework, we can consider expanding the range.

OK, I'll multiply the original score with 10.

liu-cong

/lgtm to me overall.

Thanks Draven!

keps/sig-scheduling/20180409-scheduling-framework.md

ahg-g · 2019-08-06T10:22:33Z

@draveness can you please add a new commit when making new changes to the PR instead of amending? this is useful for the reviewers to see what changes were made since the last review.

draveness · 2019-08-06T10:24:55Z

@draveness can you please add a new commit when making new changes to the PR instead of amending? this is useful for the reviewers to see what changes were made since the last review.

Sure, I could do this the next time.

draveness · 2019-08-07T02:51:04Z

To summaries, what we reached consensus is as below:

A Score function should return a value in the range [0, 100] unless a normalize score is provided
A NormalizeScore should return a value in the range [0, 100] when provided
A NormalizeScore is optional to implement, the framework could provide a default implementation, but it should explicitly be called by the plugin (we shouldn't call it by default). feat: remove normalize score section in the scheduling framwork #1178 (comment)
Multiply the score returned by priority function by 10. feat: remove normalize score section in the scheduling framwork #1178 (comment)
Use a named array instead of a score array in normalize score function. Per discussion in feat: use named array instead of array in normalizing score kubernetes#80901 (review)

Is there anything wrong or missing? If not, I'll update the KEP and related PRs to these changes.

/cc @bsalamat @ahg-g

bsalamat · 2019-08-07T17:36:36Z

To summaries, what we reached consensus is as below:

A Score function should return a value in the range [0, 100] unless a normalize score is provided

A NormalizeScore should return a value in the range [0, 100] when provided

I would rephrase these two items as:

The output of a score plugin must be an integer in range of [0, 100]. This is the output after running the optional NormalizeScore function of the plugin. If NormalizeScore is not provided, the output of Score must be in this range.

The reason that I propose this minor change is that right after running Score functions, we do not want to check whether a NormalizeScore is provided.

A NormalizeScore is optional to implement, the framework could provide a default implementation, but it should explicitly be called by the plugin (we shouldn't call it by default). feat: remove normalize score section in the scheduling framwork #1178 (comment)

As @ahg-g pointed, the framework will not provide a default implementation. When the output of a score plugin without NormalizeScore is in the [0,100] range, there is no need to run any normalization.

Multiply the score returned by priority function by 10. feat: remove normalize score section in the scheduling framwork #1178 (comment)

That's one option. However, these are our internal functions. We could change them to provide finer grain scores.

1.Use a named array instead of a score array in normalize score function. Per discussion in kubernetes/kubernetes#80901 (review)

Is there anything wrong or missing? If not, I'll update the KEP and related PRs to these changes.

liu-cong · 2019-08-12T14:42:10Z

A NormalizeScore is optional to implement, the framework could provide a default implementation, but it should explicitly be called by the plugin (we shouldn't call it by default). feat: remove normalize score section in the scheduling framwork #1178 (comment)

As @ahg-g pointed, the framework will not provide a default implementation. When the output of a score plugin without NormalizeScore is in the [0,100] range, there is no need to run any normalization.

@bsalamat , It seems that the purpose of NormalizeScore is to make sure the result is in the [0, 100] range. It's the ScorePlugin developer's responsibility anyway to make sure the score results is in the range. They can choose whatever approach they want to do this, be it using a normalizeScore helper function, imbedding in the score calculation, etc. If that's the case, why do we provide NormalizeScore as a public interface? In my opinion this forces the developer to make one more decision which is not necessary.

Did I miss any thing?

ahg-g · 2019-08-12T15:37:42Z

The difference between Score and NormalizeScore is the interface, the former evaluates one node at a time, the latter evaluates all the nodes (normalization could require a global view). It is a pattern that I assume was observed frequently and warranted an explicit support from the framework.

draveness · 2019-08-19T06:43:24Z

I've updated the score section and kept the current range. Besides, I created an kubernetes/kubernetes#81281 to discuss the path to expand the range to [0, 100], we could update the KEP after the discussion.

@bsalamat @ahg-g Please take another look

keps/sig-scheduling/20180409-scheduling-framework.md

draveness · 2019-08-19T12:28:52Z

@ahg-g comments have been addressed, please take another look

ahg-g · 2019-08-19T13:31:16Z

@ahg-g comments have been addressed, please take another look

Thanks, there is still one instance of "NodeScoreMax"

draveness · 2019-08-19T17:29:29Z

@ahg-g comments have been addressed, please take another look

Thanks, there is still one instance of "NodeScoreMax"

done

ahg-g · 2019-08-20T15:19:23Z

/lgtm

please squash the commits

draveness · 2019-08-20T15:23:42Z

/lgtm

please squash the commits

Done, friendly ping @bsalamat for final approval.

bsalamat

/lgtm
/approve

Thanks, @draveness!

k8s-ci-robot · 2019-08-20T17:47:13Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, bsalamat, draveness, liu-cong

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/sig-scheduling/OWNERS~~ [bsalamat]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot assigned ahg-g and bsalamat Jul 29, 2019

k8s-ci-robot requested review from bgrant0607 and k82cn July 29, 2019 15:42

k8s-ci-robot removed the kind/feature Categorizes issue or PR as related to a new feature. label Jul 29, 2019

bsalamat reviewed Jul 29, 2019

View reviewed changes

keps/sig-scheduling/20180409-scheduling-framework.md Outdated Show resolved Hide resolved

keps/sig-scheduling/20180409-scheduling-framework.md Outdated Show resolved Hide resolved

draveness force-pushed the feature/update-normalize-score branch from ec94458 to 6e5cb9b Compare July 30, 2019 02:12

ahg-g reviewed Jul 31, 2019

View reviewed changes

draveness mentioned this pull request Jul 31, 2019

Enforce the range of score returned by the score plugin kubernetes/kubernetes#80784

Closed

liu-cong reviewed Aug 1, 2019

View reviewed changes

keps/sig-scheduling/20180409-scheduling-framework.md Outdated Show resolved Hide resolved

draveness force-pushed the feature/update-normalize-score branch from 6e5cb9b to 5bfa707 Compare August 6, 2019 02:01

draveness mentioned this pull request Aug 6, 2019

feat: return error when score is out of range kubernetes/kubernetes#81015

Merged

k8s-ci-robot requested review from ahg-g and bsalamat August 7, 2019 02:51

draveness mentioned this pull request Aug 12, 2019

[Scheduler] Expand the internal range of score to [0, 100] kubernetes/kubernetes#81281

Closed

draveness force-pushed the feature/update-normalize-score branch from 725b3ba to 3cbd7d5 Compare August 19, 2019 06:39

draveness force-pushed the feature/update-normalize-score branch from 3cbd7d5 to e27703a Compare August 19, 2019 06:45

ahg-g reviewed Aug 19, 2019

View reviewed changes

draveness force-pushed the feature/update-normalize-score branch from 3a154d3 to b2c8d03 Compare August 19, 2019 17:29

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 20, 2019

feat: reword normalize score section in the scheduling framwork

5ff5d20

draveness force-pushed the feature/update-normalize-score branch from 3fd04bc to 5ff5d20 Compare August 20, 2019 15:23

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 20, 2019

bsalamat reviewed Aug 20, 2019

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 20, 2019

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 20, 2019

k8s-ci-robot merged commit aba897e into kubernetes:master Aug 20, 2019

k8s-ci-robot added this to the v1.17 milestone Aug 20, 2019

liu-cong mentioned this pull request Aug 21, 2019

REQUEST: New membership for liu-cong kubernetes/org#1127

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: remove normalize score section in the scheduling framwork #1178

feat: remove normalize score section in the scheduling framwork #1178

draveness commented Jul 29, 2019 •

edited

Loading

draveness commented Jul 29, 2019

k8s-ci-robot commented Jul 29, 2019

draveness commented Jul 30, 2019

draveness commented Jul 30, 2019

ahg-g left a comment

ahg-g Jul 31, 2019

draveness Jul 31, 2019 •

edited

Loading

ahg-g Jul 31, 2019

draveness Jul 31, 2019 •

edited

Loading

ahg-g Jul 31, 2019

draveness Aug 6, 2019 •

edited

Loading

draveness Aug 6, 2019

ahg-g Aug 6, 2019

bsalamat Aug 6, 2019

draveness Aug 7, 2019

liu-cong left a comment

ahg-g commented Aug 6, 2019

draveness commented Aug 6, 2019

draveness commented Aug 7, 2019 •

edited

Loading

bsalamat commented Aug 7, 2019

liu-cong commented Aug 12, 2019

ahg-g commented Aug 12, 2019

draveness commented Aug 19, 2019

draveness commented Aug 19, 2019 •

edited

Loading

ahg-g commented Aug 19, 2019

draveness commented Aug 19, 2019

ahg-g commented Aug 20, 2019

draveness commented Aug 20, 2019

bsalamat left a comment

k8s-ci-robot commented Aug 20, 2019

feat: remove normalize score section in the scheduling framwork #1178

feat: remove normalize score section in the scheduling framwork #1178

Conversation

draveness commented Jul 29, 2019 • edited Loading

draveness commented Jul 29, 2019

k8s-ci-robot commented Jul 29, 2019

draveness commented Jul 30, 2019

draveness commented Jul 30, 2019

ahg-g left a comment

Choose a reason for hiding this comment

ahg-g Jul 31, 2019

Choose a reason for hiding this comment

draveness Jul 31, 2019 • edited Loading

Choose a reason for hiding this comment

ahg-g Jul 31, 2019

Choose a reason for hiding this comment

draveness Jul 31, 2019 • edited Loading

Choose a reason for hiding this comment

ahg-g Jul 31, 2019

Choose a reason for hiding this comment

draveness Aug 6, 2019 • edited Loading

Choose a reason for hiding this comment

draveness Aug 6, 2019

Choose a reason for hiding this comment

ahg-g Aug 6, 2019

Choose a reason for hiding this comment

bsalamat Aug 6, 2019

Choose a reason for hiding this comment

draveness Aug 7, 2019

Choose a reason for hiding this comment

liu-cong left a comment

Choose a reason for hiding this comment

ahg-g commented Aug 6, 2019

draveness commented Aug 6, 2019

draveness commented Aug 7, 2019 • edited Loading

bsalamat commented Aug 7, 2019

liu-cong commented Aug 12, 2019

ahg-g commented Aug 12, 2019

draveness commented Aug 19, 2019

draveness commented Aug 19, 2019 • edited Loading

ahg-g commented Aug 19, 2019

draveness commented Aug 19, 2019

ahg-g commented Aug 20, 2019

draveness commented Aug 20, 2019

bsalamat left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Aug 20, 2019

draveness commented Jul 29, 2019 •

edited

Loading

draveness Jul 31, 2019 •

edited

Loading

draveness Jul 31, 2019 •

edited

Loading

draveness Aug 6, 2019 •

edited

Loading

draveness commented Aug 7, 2019 •

edited

Loading

draveness commented Aug 19, 2019 •

edited

Loading