Codestin Search App

b8018

vendor : update cpp-httplib (ggml-org#19537)

Signed-off-by: Adrien Gallouët <[email protected]>

Feb 12, 2026
4b385bf
zip
tar.gz

b7980

mtmd: Implement tiling for LFM2-VL (ggml-org#19454)

Feb 9, 2026
262364e
zip
tar.gz

b7926

vulkan: disable coopmat1 fa on Nvidia Turing (ggml-org#19290)

Feb 3, 2026
32b17ab
zip
tar.gz

b7616

vulkan: Optimize GGML_OP_CUMSUM (ggml-org#18417)

* vulkan: Optimize GGML_OP_CUMSUM

There are two paths: The preexisting one that does a whole row per workgroup
in a single shader, and one that splits each row into multiple blocks and does
two passes. The first pass computes partials within a block, the second adds
the block partials to compute the final result. The multipass shader is used
when there are a small number of large rows.

In the whole-row shader, handle multiple elements per invocation.

* use 2 ELEM_PER_THREAD for AMD/Intel

* address feedback

Jan 2, 2026
18ddaea
zip
tar.gz

b7585

common : default content to an empty string (ggml-org#18485)

* common : default content to an empty string

* common : fix tests that break when content != null

Dec 30, 2025
0f89d2e
zip
tar.gz

b7577

webui: fix prompt progress ETA calculation (ggml-org#18468)

* webui: fix prompt progress ETA calculation

* handle case done === 0

Dec 29, 2025
51a4872
zip
tar.gz

b7170

server: enable jinja by default, update docs (ggml-org#17524)

* server: enable jinja by default, update docs

* fix tests

Nov 27, 2025
e509411
zip
tar.gz

b6560

ci : disable AMD workflows + update NVIDIA workflows (ggml-org#16200)

* ci : disable AMD workflows + update NVIDIA workflows

* cont : fixes

* cont : update nvidia vulkan workflows

Sep 23, 2025
f505bd8
zip
tar.gz
Downloads

b6209

opencl: mark `argsort` unsupported if cols exceed workgroup limit (gg…

…ml-org#15375)

Aug 19, 2025
fb22dd0
zip
tar.gz
Downloads

b6199

mtmd : clean up clip_n_output_tokens (ggml-org#15391)

Aug 18, 2025
f08c4c0
zip
tar.gz
Downloads

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

b8018

b7980

b7926

b7616

b7585

b7577

b7170

b6560

b6209

b6199

Tags: wanghqc/llama.cpp