-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Bug 1977100: retry setting CPU load balancing #5786
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1977100: retry setting CPU load balancing #5786
Conversation
|
Hi @cynepco3hahue. Thanks for your PR. I'm waiting for a cri-o member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
c7c874d to
5502351
Compare
|
@haircommander Can you please review it? |
| // TODO: re-visit once we will have some more acceptable cgroups hierarchy to disable CPU load balancing | ||
| // correctly via cgroups, see -https://bugzilla.redhat.com/show_bug.cgi?id=1946801 | ||
| return wait.PollImmediate(time.Second, 5*time.Second, func() (bool, error) { | ||
| if err := setCPUSLoadBalancing(c, enable, schedDomainDir); err != nil { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should we only do this if os.IsNotExist(err)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes it makes sense
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
It possible that the kernel will rebuild sched_domain related files and because of it enabling or disabling CPU load balancing for container CPUs will fail with different file errors: 1. lstat /proc/sys/kernel/sched_domain/cpu22/domain1/flags: no such file or directory 2. readdirent /proc/sys/kernel/sched_domain/cpu66/domain0: no such file or directory Add retry logic around setting CPU load balancing values to reduce possibility of such errors. Signed-off-by: Artyom Lukianov <[email protected]>
5502351 to
1098cc9
Compare
|
/approve LGTM |
|
@haircommander Can you please add |
|
/ok-to-test |
|
/retest |
1 similar comment
|
/retest |
|
@haircommander Do you know whom can I ask for an additional review? |
saschagrunert
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good one!
/lgtm
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: cynepco3hahue, haircommander, saschagrunert The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Thanks! |
|
/test e2e-gcp |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/cherry-pick release-1.21 |
|
@kolyshkin: new pull request created: #5919 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/cherry-pick release-1.22 |
|
@kolyshkin: new pull request created: #5920 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/cherry-pick release-1.23 |
|
@kolyshkin: new pull request created: #5921 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
What type of PR is this?
/kind bug
What this PR does / why we need it:
It is possible that the kernel will rebuild sched_domain related files and
because of it enabling or disabling CPU load balancing for container CPUs
will fail with different file errors:
Add retry logic around setting CPU load balancing values to reduce the possibility of such errors.
Signed-off-by: Artyom Lukianov [email protected]