-
Notifications
You must be signed in to change notification settings - Fork 89
Autotune persistent kernels for multi occupancy #1307
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
stack-info: PR: #1307, branch: oulgen/stack/195
54977ac to
94143b9
Compare
stack-info: PR: #1307, branch: oulgen/stack/195
94143b9 to
f922b06
Compare
helion/autotuner/config_spec.py
Outdated
| # Validate num_sm_multiplier is only used with persistent kernels | ||
| pid_type = config["pid_type"] | ||
| num_sm_multiplier = config.get("num_sm_multiplier", 1) | ||
| if num_sm_multiplier != 1 and pid_type in ("flat", "xyz"): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we'd probaby also want to exclude values larger than 1 if careveout/reserved sm is set
jansel
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we may want to combine this with setting maxnreg in Triton to ensure the higher occupancy can launch at the same time.
f922b06 to
4f672c8
Compare
stack-info: PR: #1307, branch: oulgen/stack/195
stack-info: PR: #1307, branch: oulgen/stack/195
4f672c8 to
637e300
Compare
No description provided.