Tags: sorasoras/llama.cpp
Tags
Vulkan Embedding Fix (ggml-org#7360) * Fix empty Vulkan host buffers Add fp32 fp16 matmul shader Fix matmul shader alignment * Remove deprecated tensor->backend uses * Fix Vulkan validation errors on embedding models with no offloaded layers * Fix Vulkan llava segfault when not offloading layers
Revert "move ndk code to a new library (ggml-org#6951)" (ggml-org#7282) This reverts commit efc8f76.
PreviousNext