Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

SYCL: Avoid using SYCL-Graph for unsupported nodes ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13587 opened May 16, 2025 by EwanC Loading…
CUDA: skip fully masked-out KV in FA vec kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13584 opened May 16, 2025 by JohannesGaessler Loading…
server : separate the notion of position and KV tokens, remove prompt truncation breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples python python script changes server
#13576 opened May 15, 2025 by ngxson Loading…
Update python verions examples python python script changes server
#13574 opened May 15, 2025 by robbiemu Loading…
gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method python python script changes
#13561 opened May 15, 2025 by CISC Loading…
vulkan: move common FA code to flash_attn_base.comp ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#13556 opened May 15, 2025 by jeffbolznv Loading…
vulkan: use scalar FA rather than coopmat2 when N==1 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#13554 opened May 15, 2025 by jeffbolznv Loading…
Granite Four Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#13550 opened May 14, 2025 by gabe-l-hart Draft
2 tasks
sycl : reviewing the backend documentation documentation Improvements or additions to documentation examples SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13544 opened May 14, 2025 by Alcpz Loading…
Fix build on OpenBSD examples
#13541 opened May 14, 2025 by percypiper Loading…
sycl: disable reorder for sycl mulmat ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13536 opened May 14, 2025 by sgeor255 Loading…
ci : upgraded oneAPI version in SYCL workflows and dockerfile devops improvements to build systems and github actions
#13532 opened May 14, 2025 by Alcpz Loading…
cuda: set cuda compiler path (#13527) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13528 opened May 14, 2025 by lizhenneng Loading…
convert: Swap GLM4 EOS / EOT token python python script changes
#13505 opened May 13, 2025 by henk717 Loading…
[SYCL] Overcoming workaround for mmap() allocation on Windows and remove useless wait examples ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13482 opened May 12, 2025 by s-Nick Loading…
docker : enable RPC for docker images devops improvements to build systems and github actions
#13474 opened May 12, 2025 by rgerganov Draft
Support Seed-Coder chat template
#13472 opened May 12, 2025 by yeahdongcn Loading…
2 tasks done
Webui dynamic config examples server
#13429 opened May 10, 2025 by ServeurpersoCom Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.