Codestin Search App

30 Apr 14:21

77e15be

b2773 Latest

Latest

metal : remove deprecated error code (#7008)

Assets 19

cudart-llama-bin-win-cu11.7.1-x64.zip

293 MB 2024-04-30T14:21:41Z
cudart-llama-bin-win-cu12.2.0-x64.zip

413 MB 2024-04-30T14:21:51Z
llama-b2773-bin-macos-arm64.zip

39.1 MB 2024-04-30T14:21:58Z
llama-b2773-bin-macos-x64.zip

35.7 MB 2024-04-30T14:21:59Z
llama-b2773-bin-ubuntu-x64.zip

44.1 MB 2024-04-30T14:22:00Z
llama-b2773-bin-win-arm64-x64.zip

5.85 MB 2024-04-30T14:22:02Z
llama-b2773-bin-win-avx-x64.zip

6.37 MB 2024-04-30T14:22:02Z
llama-b2773-bin-win-avx2-x64.zip

6.36 MB 2024-04-30T14:22:03Z
llama-b2773-bin-win-avx512-x64.zip

6.38 MB 2024-04-30T14:22:03Z
llama-b2773-bin-win-clblast-x64.zip

7.56 MB 2024-04-30T14:22:04Z
Source code (zip)

2024-04-30T12:52:21Z
Source code (tar.gz)

2024-04-30T12:52:21Z

30 Apr 06:17

github-actions

b2769

8843a98

b2769

Improve usability of --model-url & related flags (#6930)

* args: default --model to models/ + filename from --model-url or --hf-file (or else legacy models/7B/ggml-model-f16.gguf)

* args: main & server now call gpt_params_handle_model_default

* args: define DEFAULT_MODEL_PATH + update cli docs

* curl: check url of previous download (.json metadata w/ url, etag & lastModified)

* args: fix update to quantize-stats.cpp

* curl: support legacy .etag / .lastModified companion files

* curl: rm legacy .etag file support

* curl: reuse regex across headers callback calls

* curl: unique_ptr to manage lifecycle of curl & outfile

* curl: nit: no need for multiline regex flag

* curl: update failed test (model file collision) + gitignore *.gguf.json

Assets 19

29 Apr 05:15

github-actions

b2755

e00b4a8

b2755

Fix more int overflow during quant (PPL/CUDA). (#6563)

* Fix more int overflow during quant.

* Fix some more int overflow in softmax.

* Revert back to int64_t.

Assets 19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: sroecker/llama.cpp

b2773

b2769

b2755