-
Notifications
You must be signed in to change notification settings - Fork 13.5k
Fix segfault in moe-expert-reduce test in support mode and coverage #16936
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@am17an Please review |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the fix!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It would be better to filter out fusion cases in the supports test, but also initialize gf in eval_support.
|
Thanks @slaren, I incorporated your suggestions. Please let me know if this was the intended approach |
|
@slaren Thank you for your guidance and feedback. Please have a look
|
* origin/master: (169 commits) opencl: support imrope (ggml-org#16914) fix: Viewing multiple PDF attachments (ggml-org#16974) model-conversion : pass config to from_pretrained (ggml-org#16963) server : add props.model_alias (ggml-org#16943) ggml: CUDA: add head size 72 for flash-attn (ggml-org#16962) mtmd: add --image-min/max-tokens (ggml-org#16921) mtmd: pad mask for qwen2.5vl (ggml-org#16954) ggml : LoongArch fixes (ggml-org#16958) sync: minja (glm 4.6 & minmax m2 templates) (ggml-org#16949) SYCL: optimized repeat_back kernel (3× fewer asm instructions, 2× faster)Feature/sycl repeat back opt (ggml-org#16869) feat(webui): improve LaTeX rendering with currency detection (ggml-org#16508) test-backend-ops : fix segfault in moe-expert-reduce test in support mode and coverage (ggml-org#16936) ci : disable failing riscv cross build (ggml-org#16952) model: add Janus Pro for image understanding (ggml-org#16906) clip : use FA (ggml-org#16837) server : support unified cache across slots (ggml-org#16736) common : move gpt-oss reasoning processing to init params (ggml-org#16937) docs: remove llama_sampler_accept reference in sampling sample usage (ggml-org#16920) CUDA: add FLOOR, CEIL, ROUND, TRUNC unary ops (ggml-org#16917) devops: fix failing s390x docker build (ggml-org#16918) ...
This PR fixes a segmentation fault that occurs while running the
test-backend-opstool insupportmode or with--show-coverageflag. This will also allowdocs/ops.mdto be updated for tracking #14909 as it needs the results fromsupportmode.Root Cause
Testing does not initialize
gf(ggml_cgraph), it callsbuild_graphmethod for each test case. Thetest_moe_expert_reducetest case callsggml_build_forward_expand(gf, ...)inside itsbuild_graphmethod butgfis anullptrin this flow which causes a seg fault.Solution
Wrap the
ggml_build_forward_expandcall in agfnull check.