-
Notifications
You must be signed in to change notification settings - Fork 13.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add support for TRUNC unary operator (CPU +SYCL)
Ascend NPU
issues specific to Ascend NPUs
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#16032
opened Sep 16, 2025 by
safranowith
Loading…
cmake : fix static linking for OpenMP on Unix-like systems
ggml
changes relating to the ggml tensor library for machine learning
#16031
opened Sep 16, 2025 by
angt
Loading…
Add support for Ling v2
python
python script changes
#16028
opened Sep 16, 2025 by
im0qianqian
Loading…
llama-quant : fix the verification of attention layers for encoder-decoder models
#16023
opened Sep 16, 2025 by
DamonFool
Loading…
[WIP] Rpc split row
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
#16020
opened Sep 16, 2025 by
LeaveNhA
Loading…
GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#16018
opened Sep 15, 2025 by
reeselevine
Loading…
Deterministic inference mode (CUDA): RMSNorm, MatMul, Attention, KV-cache
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
script
Script related
testing
Everything test related
Add Olmo3 implementation
python
python script changes
#16015
opened Sep 15, 2025 by
2015aroras
Loading…
Guard ThreadPowerThrottling for non-MSVC builds
ggml
changes relating to the ggml tensor library for machine learning
#16014
opened Sep 15, 2025 by
B1rds3y
Loading…
Add ROUND operator support for CPU and SYCL backends
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#16011
opened Sep 15, 2025 by
safranowith
Loading…
SYCL: Add GGML_OP_MEAN operator support
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16009
opened Sep 15, 2025 by
yael-works
Loading…
ci : create git tags for released docker images
devops
improvements to build systems and github actions
#16008
opened Sep 15, 2025 by
rgerganov
Loading…
SYCL/SET: Implement and document full support for SET operator
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16006
opened Sep 15, 2025 by
GittyBurstein
Loading…
Add CEIL operator support for CPU and SYCL backends
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#16005
opened Sep 15, 2025 by
safranowith
Loading…
examples : support encoder-decoder models in the simple example
examples
#16002
opened Sep 15, 2025 by
DamonFool
Loading…
--numa mirror
: mirror model weights to every Numa node in the system
Apple Metal
metal : refactor + optimize v2
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
vulkan : shader development improvements
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15993
opened Sep 14, 2025 by
Acly
Loading…
ggml: add FLOOR unary op (CPU + SYCL)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#15989
opened Sep 14, 2025 by
safranowith
Loading…
metal : use virtual GPU address for private buffers
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
CUDA: fix FA occupancy, optimize tile kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15982
opened Sep 14, 2025 by
JohannesGaessler
Loading…
SYCL: Add ARANGE operator with GPU kernel, tests, and documentation
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#15978
opened Sep 14, 2025 by
GittyBurstein
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.