-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
SYCL: Avoid using SYCL-Graph for unsupported nodes
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13587
opened May 16, 2025 by
EwanC
Loading…
CUDA: skip fully masked-out KV in FA vec kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13584
opened May 16, 2025 by
JohannesGaessler
Loading…
server : separate the notion of position and KV tokens, remove prompt truncation
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
python
python script changes
server
#13576
opened May 15, 2025 by
ngxson
Loading…
gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method
python
python script changes
#13561
opened May 15, 2025 by
CISC
Loading…
vulkan: move common FA code to flash_attn_base.comp
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#13556
opened May 15, 2025 by
jeffbolznv
Loading…
vulkan: use scalar FA rather than coopmat2 when N==1
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#13554
opened May 15, 2025 by
jeffbolznv
Loading…
Granite Four
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#13550
opened May 14, 2025 by
gabe-l-hart
•
Draft
2 tasks
sycl : reviewing the backend documentation
documentation
Improvements or additions to documentation
examples
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13544
opened May 14, 2025 by
Alcpz
Loading…
sycl: disable reorder for sycl mulmat
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13536
opened May 14, 2025 by
sgeor255
Loading…
ci : upgraded oneAPI version in SYCL workflows and dockerfile
devops
improvements to build systems and github actions
#13532
opened May 14, 2025 by
Alcpz
Loading…
cuda: set cuda compiler path (#13527)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13528
opened May 14, 2025 by
lizhenneng
Loading…
webui: Add editing assistant messages (#11849)
examples
server
#13522
opened May 14, 2025 by
lr1729
Loading…
convert: Swap GLM4 EOS / EOT token
python
python script changes
#13505
opened May 13, 2025 by
henk717
Loading…
llama: Add configuration presets for chat and reranking servers
#13462
opened May 12, 2025 by
heyyymonth
Loading…
Break down main function in llama-server
examples
server
#13425
opened May 10, 2025 by
ericcurtin
Loading…
Update README.md for using llama.cpp in Microsoft Word locally
#13401
opened May 9, 2025 by
GPTLocalhost
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.