Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Tags: robyngraf/llama.cpp

Tags

master-d411968

Toggle master-d411968's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
opencl : support k-quants (ggml-org#1836)

* Porting q2_k kernel to OpenCL

* Set global and local sizes for kernel calls for dequantizing k-quants

* Added q6_k kernel

* Fix q4_k opencl struct order

* Replace uchar with uint8_t

* Finish dequant kernels

* Added OpenCL DMMV kernels

* Fix q2_k, improve code

* Fix q3_k

* Shorten switch statements

* Improve code formatting

---------

Co-authored-by: Concedo <[email protected]>

master-b41b4ca

Toggle master-b41b4ca's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
examples : add "simple" (ggml-org#1840)

* Create `simple.cpp`

* minimalist example `CMakeLists.txt`

* Update Makefile for minimalist example

* remove 273: Trailing whitespace

* removed trailing white spaces simple.cpp

* typo and comments simple.cpp

---------

Co-authored-by: Georgi Gerganov <[email protected]>

master-ac3b886

Toggle master-ac3b886's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
llama : fix embd when offloading non-repeating layers (ggml-org#1891)

master-13fe9d2

Toggle master-13fe9d2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
cmake : add auto detection of BLAS_INCLUDE_DIRS (ggml-org#1886)

master-9cbf50c

Toggle master-9cbf50c's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
build : fix and ignore MSVC warnings (ggml-org#1889)

master-5b9ccaf

Toggle master-5b9ccaf's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fixed possible macro redefinition (ggml-org#1892)

MinGW libstdc++ may define `NOMINMAX` unconditionally. This fixes the case when it is already defined.

master-3d01122

Toggle master-3d01122's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CUDA : faster k-quant dot kernels (ggml-org#1862)

* cuda : faster k-quant dot kernels

* Imrove Q2_K dot kernel on older GPUs

We now have a K_QUANTS_PER_ITERATION macro, which should be
set to 1 on older and to 2 on newer GPUs.
With this, we preserve the performance of the original
PR on RTX-4080, and are faster compared to master on
GTX-1660.

* Imrove Q6_K dot kernel on older GPUs

Using the same K_QUANTS_PER_ITERATION macro as last commit,
we preserve performance on RTX-4080 and speed up
Q6_K on a GTX-1660.

* Add LLAMA_CUDA_KQUANTS_ITER to CMakeLists.txt and Makefile

Allowed values are 1 or 2. 2 gives the best performance on
modern GPUs and is set as default. On older GPUs 1 may work
better.

* PR comments

---------

Co-authored-by: Iwan Kawrakow <[email protected]>

master-cf267d1

Toggle master-cf267d1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
make : add train-text-from-scratch (ggml-org#1850)

* make finetuning example accessible

* fixed: targed was in wrong line

* fixed: name of executable was wrong

* fixed: naming of binary

* fixed: model path was wrong

* fixed clean target

* Update examples/train-text-from-scratch/README.md

---------

Co-authored-by: Georgi Gerganov <[email protected]>

master-c36e81d

Toggle master-c36e81d's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
examples : add chat-vicuna.sh (ggml-org#1854)

Co-authored-by: Yang Li <[email protected]>

master-bed9275

Toggle master-bed9275's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
cmake : remove whitespaces