Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit 23c10e8

Browse files
author
Ralf Waldukat
committed
Update to llama.cpp 2026-01-01
- Update llama.cpp submodule (2025-08-14 → 2026-01-01) - Remove deprecated KV cache functions (use llama_memory_* instead) - Remove llama_sampler_init_softmax (deprecated) - Add LLAMA_ROPE_TYPE_IMROPE constant - Add llama_flash_attn_type enum (AUTO/DISABLED/ENABLED) - Add llama_params_fit_status enum - Add llama_model_meta_key enum for sampling metadata - Add llama_model_params fields: no_host, no_alloc - Replace llama_context_params.flash_attn bool with flash_attn_type enum - Add 15 new API functions: - llama_max_tensor_buft_overrides - llama_n_ctx_seq - llama_model_n_embd_inp - llama_model_is_hybrid - llama_flash_attn_type_name - llama_model_meta_key_str - llama_adapter_meta_* functions (5) - llama_log_get/set - llama_memory_breakdown_print - Add ggml_log_callback typedef - Disable LLAVA build (CMake incompatibility in upstream mtmd) - Bump version 0.3.16 → 0.4.0 Breaking changes: - flash_attn bool removed, use flash_attn_type enum - KV cache functions removed, use llama_memory_* API Tested with Nemotron-3-Nano-30B hybrid model.
1 parent c37132b commit 23c10e8

File tree

4 files changed

+164
-259
lines changed

4 files changed

+164
-259
lines changed

CMakeLists.txt

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -153,6 +153,12 @@ if (LLAMA_BUILD)
153153
add_compile_definitions(GGML_USE_METAL)
154154
endif()
155155

156+
# Set version for mtmd (required by upstream CMakeLists.txt)
157+
if (NOT DEFINED LLAMA_BUILD_NUMBER)
158+
set(LLAMA_BUILD_NUMBER 0)
159+
endif()
160+
set(LLAMA_INSTALL_VERSION 0.0.${LLAMA_BUILD_NUMBER})
161+
156162
# Building llava
157163
add_subdirectory(vendor/llama.cpp/tools/mtmd)
158164

llama_cpp/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
11
from .llama_cpp import *
22
from .llama import *
33

4-
__version__ = "0.3.16"
4+
__version__ = "0.4.0"

0 commit comments

Comments
 (0)