Vulkan on AMD Ryzen AI APU/iGPU generates worse images than CPU, or just colorful noise

When I run stable-diffusion.cpp with Vulkan on a Ryzen AI 9 HX 370 (Radeon 890M iGPU), the resulting images are very different from what I get when running on CPU with the AVX2 build.  Some comparison pics follow.

### SDXL
For reference, the below pic is what I get from SDXL on my CPU if I prompt as follows:
`sd -m sd_xl_base_1.0.safetensors --vae sdxl.vae.safetensors -H 1024 -W 1024 -p "a lovely cat"`
(Note that I needed to use madebyollin's fp16 vae to get an output that isn't all black.)
![sd-cpp-avx2_vae-fp16_cat_1024x1024_output](https://github.com/user-attachments/assets/0da80667-eaac-49ea-9b7a-ce46d61f397d)

And below is what I get from SDXL on my GPU using Vulkan:
`sd -m sd_xl_base_1.0.safetensors --vae sdxl.vae.safetensors --vae-on-cpu -H 1024 -W 1024 -p "a lovely cat"`
(Note that running VAE on the CPU versus tiled on the GPU produces essentially the same-looking image below.  Attempting to run on GPU without tiling fails when it requests an excessive amount of memory, as described in [stduhpf's comment here](https://github.com/leejet/stable-diffusion.cpp/pull/291#issuecomment-2297736533).)
![sd-cpp-vulkan_vae-fp16-on-cpu_cat_1024x1024_output](https://github.com/user-attachments/assets/9b364b2a-e600-483f-b0c1-b957d9be5463)

### SD 1.5
With SD 1.5, Vulkan at least produces actual cat pictures, but they are blurry or deformed compared to CPU.

For reference, below is what I get from the CPU for the following prompt:
`sd -m v1-5-pruned-emaonly.safetensors -p "a lovely cat"`
![sd-cpp-avx2_cat_output](https://github.com/user-attachments/assets/1c619cd4-57f8-40d2-81af-3d25edf62ec0)

And below is what I get from the GPU with Vulkan:
`sd -m v1-5-pruned-emaonly.safetensors -p "a lovely cat"`
(I also tried running this with the VAE on the CPU, but it gives the same cat below with no apparent visual difference.)
![sd-cpp-vulkan_cat_output](https://github.com/user-attachments/assets/fb600333-0c35-4197-9752-32a62d10b838)

Finally, running clip on the CPU gives a different, more-deformed cat:
`sd -m v1-5-pruned-emaonly.safetensors --vae-on-cpu --clip-on-cpu -p "a lovely cat"`
![sd-cpp-vulkan_vae+clip-on-cpu_cat_output](https://github.com/user-attachments/assets/d67bc1cb-b956-4f5f-a0fc-d12cecbb8cdb)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vulkan on AMD Ryzen AI APU/iGPU generates worse images than CPU, or just colorful noise #563

SDXL

SD 1.5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Vulkan on AMD Ryzen AI APU/iGPU generates worse images than CPU, or just colorful noise #563

Description

SDXL

SD 1.5

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions