Codestin Search App

b5350

mtmd : Use RMS norm for InternVL 3 38B and 78B mmproj (ggml-org#13459)

May 11, 2025
c104023
zip
tar.gz
Downloads

b4696

HIP: Switch to std::vector in rocblas version check (ggml-org#11820)

Feb 12, 2025
e598697
zip
tar.gz
Downloads

b4476

server : (UI) Improve messages bubble shape in RTL (ggml-org#11220)

I simply have overlooked message bubble's tail placement for RTL
text as I use the dark mode and that isn't visible there and this
fixes it.

Jan 13, 2025
504af20
zip
tar.gz
Downloads

b4457

llama: add support for QRWKV6 model architecture (ggml-org#11001)

llama: add support for QRWKV6 model architecture (ggml-org#11001)

* WIP: Add support for RWKV6Qwen2

Signed-off-by: Molly Sophia <[email protected]>

* RWKV: Some graph simplification

Signed-off-by: Molly Sophia <[email protected]>

* Add support for RWKV6Qwen2 with cpu and cuda GLA

Signed-off-by: Molly Sophia <[email protected]>

* RWKV6[QWEN2]: Concat lerp weights together to reduce cpu overhead

Signed-off-by: Molly Sophia <[email protected]>

* Fix some typos

Signed-off-by: Molly Sophia <[email protected]>

* code format changes

Signed-off-by: Molly Sophia <[email protected]>

* Fix wkv test & add gla test

Signed-off-by: Molly Sophia <[email protected]>

* Fix cuda warning

Signed-off-by: Molly Sophia <[email protected]>

* Update README.md

Signed-off-by: Molly Sophia <[email protected]>

* Update ggml/src/ggml-cuda/gla.cu

Co-authored-by: Georgi Gerganov <[email protected]>

* Fix fused lerp weights loading with RWKV6

Signed-off-by: Molly Sophia <[email protected]>

* better sanity check skipping for QRWKV6 in llama-quant

thanks @compilade

Signed-off-by: Molly Sophia <[email protected]>
Co-authored-by: compilade <[email protected]>

---------

Signed-off-by: Molly Sophia <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
Co-authored-by: compilade <[email protected]>

Jan 10, 2025
ee7136c
zip
tar.gz
Downloads

b4447

ci : use actions from ggml-org (ggml-org#11140)

Jan 8, 2025
f7cd133
zip
tar.gz
Downloads

b4444

sync : ggml

Jan 8, 2025
99a3755
zip
tar.gz
Downloads

b4431

llama-run : fix context size (ggml-org#11094)

Set `n_ctx` equal to `n_batch` in `Opt` class. Now context size is
a more reasonable 2048.

Signed-off-by: Eric Curtin <[email protected]>

Jan 6, 2025
dc7cef9
zip
tar.gz
Downloads

b4311

common : add missing env var for speculative (ggml-org#10801)

Dec 12, 2024
9fdb124
zip
tar.gz
Downloads

b4306

Update README.md (ggml-org#10772)

Dec 11, 2024
1a31d0d
zip
tar.gz
Downloads

b4295

CUDA: fix shared memory access condition for mmv (ggml-org#10740)

Dec 9, 2024
26a8406
zip
tar.gz
Downloads

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b5350

b4696

b4476

b4457

b4447

b4444

b4431

b4311

b4306

b4295

Tags: VJHack/llama.cpp