Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Tags: wanghqc/llama.cpp

Tags

b8018

Toggle b8018's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vendor : update cpp-httplib (ggml-org#19537)

Signed-off-by: Adrien Gallouët <[email protected]>

b7980

Toggle b7980's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
mtmd: Implement tiling for LFM2-VL (ggml-org#19454)

b7926

Toggle b7926's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vulkan: disable coopmat1 fa on Nvidia Turing (ggml-org#19290)

b7616

Toggle b7616's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vulkan: Optimize GGML_OP_CUMSUM (ggml-org#18417)

* vulkan: Optimize GGML_OP_CUMSUM

There are two paths: The preexisting one that does a whole row per workgroup
in a single shader, and one that splits each row into multiple blocks and does
two passes. The first pass computes partials within a block, the second adds
the block partials to compute the final result. The multipass shader is used
when there are a small number of large rows.

In the whole-row shader, handle multiple elements per invocation.

* use 2 ELEM_PER_THREAD for AMD/Intel

* address feedback

b7585

Toggle b7585's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
common : default content to an empty string (ggml-org#18485)

* common : default content to an empty string

* common : fix tests that break when content != null

b7577

Toggle b7577's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
webui: fix prompt progress ETA calculation (ggml-org#18468)

* webui: fix prompt progress ETA calculation

* handle case done === 0

b7170

Toggle b7170's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
server: enable jinja by default, update docs (ggml-org#17524)

* server: enable jinja by default, update docs

* fix tests

b6560

Toggle b6560's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ci : disable AMD workflows + update NVIDIA workflows (ggml-org#16200)

* ci : disable AMD workflows + update NVIDIA workflows

* cont : fixes

* cont : update nvidia vulkan workflows

b6209

Toggle b6209's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
opencl: mark `argsort` unsupported if cols exceed workgroup limit (gg…

…ml-org#15375)

b6199

Toggle b6199's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
mtmd : clean up clip_n_output_tokens (ggml-org#15391)