Codestin Search App

inforithmics · 2026-03-01T11:01:10Z

Vulkan Use Mmap (disable Vulkan MMAp opt out)

Draft until new Vendor sync because in newer llama.cpp Versions mmap works without problems in Vulkan (It shows that it uses more memory but in reality it doesn't) It shows more used memory in Process and gpu but in effect there is more free memory.

The reason it seems that the Memory is duplicated is that the Shared Memory used in the GPU is also reported to be used by ollama (But it is the same memory) So for example 6GB Shared Memory used Ollama although "uses" 6GB Memory. This hapens on iGPU where the shared Memory is used for GPU offload.

Vulkan Use Mmap

da8659a

inforithmics marked this pull request as draft March 1, 2026 11:01

inforithmics changed the title ~~Vulkan Use Mmap (disable Vulkan MMAp opt out)~~ Vulkan Use Mmap (disable Vulkan MMap opt out) Mar 1, 2026

inforithmics mentioned this pull request Mar 1, 2026

Revert revert vendor update (Vendor Update to b8187) #14134

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vulkan Use Mmap (disable Vulkan MMap opt out)#14525

Vulkan Use Mmap (disable Vulkan MMap opt out)#14525
inforithmics wants to merge 1 commit intoollama:mainfrom
inforithmics:VulkanUseMMap

inforithmics commented Mar 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

inforithmics commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

inforithmics commented Mar 1, 2026 •

edited

Loading