llama : add simple option to enable CPU for MoE weights (--cpu-moe) #14992

slaren · 2025-07-31T16:10:36Z

This is intended to be a simple and curated way to use the CPU for the MoE weights. Internally, it is just setting up the appropriate tensor overrides, but this should be easier to use.

common/arg.cpp

jacekpoplawski · 2025-07-31T19:16:04Z

Am I correct that this is on/off? It would be better to have an option for the number of layers (similar to -ngl).

slaren · 2025-07-31T21:23:37Z

I am not convinced that it would be worth it. The goal here is to have a very simple option that works well enough for most people. If you want to min-max, you can still use the --override-tensor option to customize it in any way you want.

jacekpoplawski · 2025-08-01T04:18:24Z

Yes, I understand. And now I have an idea for my experiments :)

…gml-org#14992)

llama : add simple option to enable CPU for MoE weights (--cpu-moe)

8833f22

ggerganov approved these changes Jul 31, 2025

View reviewed changes

common/arg.cpp Show resolved Hide resolved

slaren merged commit a06ed5f into master Jul 31, 2025
47 checks passed

slaren deleted the sl/moe-switch branch July 31, 2025 18:15

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Aug 1, 2025

llama : add simple option to enable CPU for MoE weights (--cpu-moe) (g…

630e9a6

…gml-org#14992)

slaren mentioned this pull request Aug 4, 2025

llama : add --n-cpu-moe option #15077

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : add simple option to enable CPU for MoE weights (--cpu-moe) #14992

llama : add simple option to enable CPU for MoE weights (--cpu-moe) #14992

Uh oh!

slaren commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

jacekpoplawski commented Jul 31, 2025

Uh oh!

slaren commented Jul 31, 2025

Uh oh!

jacekpoplawski commented Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

llama : add simple option to enable CPU for MoE weights (--cpu-moe) #14992

llama : add simple option to enable CPU for MoE weights (--cpu-moe) #14992

Uh oh!

Conversation

slaren commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

jacekpoplawski commented Jul 31, 2025

Uh oh!

slaren commented Jul 31, 2025

Uh oh!

jacekpoplawski commented Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants