Codestin Search App

b5621

rpc : nicer error messages for RPC server crash (ggml-org#14076)

Jun 10, 2025
2bb0467
zip
tar.gz

b5620

sync : ggml

ggml-ci

Jun 10, 2025
b8e2194
zip
tar.gz

b5618

metal : use less stack memory in FA kernel (ggml-org#14088)

* metal : use less stack memory in FA kernel

ggml-ci

* cont : fix BF16 variant

Jun 9, 2025
1f63e75
zip
tar.gz

b5617

kv-cache : fix shift and defrag logic (ggml-org#14081)

* kv-cache : fix shift

ggml-ci

* cont : reset shift[i]

ggml-ci

* cont : fix defrag erasing cells that didn't move

ggml-ci

Jun 9, 2025
40cbf57
zip
tar.gz

b5616

llama : allow building all tests on windows when not using shared libs (

ggml-org#13980)

* llama : allow building all tests on windows when not using shared libraries

* add static windows build to ci

* tests : enable debug logs for test-chat

---------

Co-authored-by: Georgi Gerganov <[email protected]>

Jun 9, 2025
7f4fbe5
zip
tar.gz

b5615

ggml-cpu : split arch-specific implementations (ggml-org#13892)

* move ggml-cpu-aarch64 to repack

* split quantize_row_q8_0/1

* split helper functions

* split ggml_vec_dot_q4_0_q8_0

* split ggml_vec_dot_q4_1_q8_1

* split ggml_vec_dot_q5_0_q8_0

* split ggml_vec_dot_q5_1_q8_1

* split ggml_vec_dot_q8_0_q8_0

* split ggml_vec_dot_tq1_0_q8_K

* split ggml_vec_dot_tq2_0_q8_K

* split ggml_vec_dot_q2_K_q8_K

* split ggml_vec_dot_q3_K_q8_K

* split ggml_vec_dot_q4_K_q8_K

* split ggml_vec_dot_q5_K_q8_K

* split ggml_vec_dot_q6_K_q8_K

* split ggml_vec_dot_iq2_xxs_q8_K

* split ggml_vec_dot_iq2_xs_q8_K

* split ggml_vec_dot_iq2_s_q8_K

* split ggml_vec_dot_iq3_xxs_q8_K

* split ggml_vec_dot_iq3_s_q8_K

* split ggml_vec_dot_iq1_s_q8_K

* split ggml_vec_dot_iq1_m_q8_K

* split ggml_vec_dot_iq4_nl_q8_0

* split ggml_vec_dot_iq4_xs_q8_K

* fix typos

* fix missing prototypes

* rename ggml-cpu-quants.c

* rename ggml-cpu-traits

* rename arm folder

* move cpu-feats-x86.cpp

* rename ggml-cpu-hbm

* update arm detection macro in quants.c

* move iq quant tables

* split ggml_quantize_mat_q8_0/K

* split ggml_gemv_*

* split ggml_gemm_*

* rename namespace aarch64 to repack

* use weak aliases to replace test macros

* rename GGML_CPU_AARCH64 to GGML_CPU_REPACK

* rename more aarch64 to repack

* clean up rebase leftover

* fix compilation errors

* remove trailing spaces

* try to fix clang compilation errors

* try to fix clang compilation errors again

* try to fix clang compilation errors, 3rd attempt

* try to fix clang compilation errors, 4th attempt

* try to fix clang compilation errors, 5th attempt

* try to fix clang compilation errors, 6th attempt

* try to fix clang compilation errors, 7th attempt

* try to fix clang compilation errors, 8th attempt

* try to fix clang compilation errors, 9th attempt

* more cleanup

* fix compilation errors

* fix apple targets

* fix a typo in arm version of ggml_vec_dot_q4_K_q8_K

Co-authored-by: Georgi Gerganov <[email protected]>

---------

Co-authored-by: Georgi Gerganov <[email protected]>

Jun 9, 2025
f470bc3
zip
tar.gz

b5614

cuda : fix device sync on buffer clear (ggml-org#14033)

Jun 9, 2025
8f47e25
zip
tar.gz

b5613

graph : fix geglu (ggml-org#14077)

ggml-ci

Jun 9, 2025
201b31d
zip
tar.gz

b5612

CANN: Simplify the environment variable setting(ggml-org#13104)

* Simplify the environment variable setting to specify the memory pool type.

* Adjust the GGML_CANN_ASYNC_MODE setting to accept yes, enable, 1, or on (case-insensitive) as valid options.

* update

* fix CI

* update

* delete whitespace

* fix according to review

* update CANN.md

* update CANN.md

Jun 9, 2025
e21d2d4
zip
tar.gz

b5610

server : fix LRU check (ggml-org#14079)

ggml-ci

Jun 9, 2025
87d34b3
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b5621

b5620

b5618

b5617

b5616

b5615

b5614

b5613

b5612

b5610

Tags: pqnet/llama.cpp