sycl: Remove not needed copy f16->f32 for dnnl mul mat #14125

ShanoToni · 2025-06-11T11:45:33Z

PR proposes when GGML_SYCL_F16=ON to allow oneDNN to handle conversion and outputting of mul_mat into f32 and enabling fpmathmode to f16.

The current approach uses the memory pool to pass a f16 dst for the oneDNN matmul and a cpy from f16 to the actual f32 output dst_dd_i
Example of performance difference observed:

Lunar Lake

current approach:

model	size	params	backend	ngl	threads	test	t/s
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	8	pp512	1475.42 ± 43.91
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	8	tg128	37.46 ± 0.38

proposed changes:

model	size	params	backend	ngl	threads	test	t/s
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	8	pp512	1566.90 ± 26.33
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	8	tg128	37.66 ± 0.49

Battlemage

current approach:

model	size	params	backend	ngl	test	t/s
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	pp512	7424.59 ± 15.58
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	tg128	99.61 ± 2.09

proposed changes:

model	size	params	backend	ngl	test	t/s
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	pp512	8148.99 ± 35.10
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	tg128	100.62 ± 2.05

Rbiessy · 2025-06-12T13:15:04Z

CI failure is unrelated so merging now

sycl: Remove not needed copy f16->f32 for dnnl mul mat

6e07852

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Jun 11, 2025

Alcpz approved these changes Jun 11, 2025

View reviewed changes

AD2605 approved these changes Jun 11, 2025

View reviewed changes

Rbiessy approved these changes Jun 11, 2025

View reviewed changes

Rbiessy merged commit ed52f36 into ggml-org:master Jun 12, 2025
121 of 129 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sycl: Remove not needed copy f16->f32 for dnnl mul mat #14125

sycl: Remove not needed copy f16->f32 for dnnl mul mat #14125

Uh oh!

ShanoToni commented Jun 11, 2025 •

edited

Loading

Uh oh!

Rbiessy commented Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!

sycl: Remove not needed copy f16->f32 for dnnl mul mat #14125

sycl: Remove not needed copy f16->f32 for dnnl mul mat #14125

Uh oh!

Conversation

ShanoToni commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Lunar Lake

Battlemage

Uh oh!

Rbiessy commented Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!

ShanoToni commented Jun 11, 2025 •

edited

Loading